Simulated annealing with reinforcement learning for the set team orienteering problem with time windows

被引：2

作者：

Yu, Vincent F. ^{[1
,2
]}

Salsabila, Nabila Yuraisyah ^{[1
]}

Lin, Shih-Wei ^{[3
,4
,5
]}

Gunawan, Aldy ^{[6
]}

机构：

[1] Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei, Taiwan

[2] Natl Taiwan Univ Sci & Technol, Ctr Cyber Phys Syst Innovat, Taipei, Taiwan

[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan

[4] Ming Chi Univ Technol, Dept Ind Engn & Management, Taipei, Taiwan

[5] Keelung Chang Gung Mem Hosp, Dept Emergency Med, Keelung City, Taiwan

[6] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 238卷

关键词：

Team orienteering problem with time windows; Set orienteering problem; Simulated annealing; LOCAL SEARCH; HEURISTICS; ALGORITHMS;

D O I：

10.1016/j.eswa.2023.121996

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This research investigates the Set Team Orienteering Problem with Time Windows (STOPTW), a new variant of the well-known Team Orienteering Problem with Time Windows and Set Orienteering Problem. In the STOPTW, customers are grouped into clusters. Each cluster is associated with a profit attainable when a customer in the cluster is visited within the customer's time window. A Mixed Integer Linear Programming model is formulated for STOPTW to maximizing total profit while adhering to time window constraints. Since STOPTW is an NP-hard problem, a Simulated Annealing with Reinforcement Learning (SARL) algorithm is developed. The proposed SARL incorporates the core concepts of reinforcement learning, utilizing the epsilon-greedy algorithm to learn the fitness values resulting from neighborhood moves. Numerical experiments are conducted to assess the performance of SARL, comparing the results with those obtained by CPLEX and Simulated Annealing (SA). For small instances, both SARL and SA algorithms outperform CPLEX by obtaining eight optimal solutions and 12 better solutions. For large instances, both algorithms obtain better solutions to 28 out of 29 instances within shorter computational times compared to CPLEX. Overall, SARL outperforms SA by resulting in lower gap percentages within the same computational times. Specifically, SARL outperforms SA in solving 13 large STOPTW benchmark instances. Finally, a sensitivity analysis is conducted to derive managerial insights.

引用

页数：15

共 50 条

[1] A simulated annealing heuristic for the team orienteering problem with time windows
Lin, Shih-Wei
Yu, Vincent F.
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 217 (01) : 94 - 107
[2] Set Team Orienteering Problem with Time Windows
Gunawan, Aldy
Yu, Vincent F.
Sutanto, Andro Nicus
Jodiawan, Panca
LEARNING AND INTELLIGENT OPTIMIZATION, LION 15, 2021, 12931 : 142 - 149
[3] A simulated annealing heuristic for the multiconstraint team orienteering problem with multiple time windows
Lin, Shih-Wei
Yu, Vincent F.
APPLIED SOFT COMPUTING, 2015, 37 : 632 - 642
[4] A reinforcement learning approach to the orienteering problem with time windows
Gama, Ricardo
Fernandes, Hugo L.
COMPUTERS & OPERATIONS RESEARCH, 2021, 133
[5] Solving the team orienteering problem with time windows and mandatory visits by multi-start simulated annealing
Lin, Shih-Wei
Yu, Vincent F.
COMPUTERS & INDUSTRIAL ENGINEERING, 2017, 114 : 195 - 205
[6] Team Orienteering Problem with Time Windows and Variable Profit
Marzal, Eliseo
Sebastia, Laura
PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 347 - 350
[7] The Multiconstraint Team Orienteering Problem with Multiple Time Windows
Souffriau, Wouter
Vansteenwegen, Pieter
Vanden Berghe, Greet
Van Oudheusden, Dirk
TRANSPORTATION SCIENCE, 2013, 47 (01) : 53 - 63
[8] A Composite Algorithm for the Team Orienteering Problem with Time Windows
Konstantopoulos, Charalampos
Orfanos, Dimitrios
2016 7TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS & APPLICATIONS (IISA), 2016,
[9] Team orienteering problem with time windows and time-dependent scores
Yu, Vincent F.
Jewpanya, Parida
Lin, Shih-Wei
Redi, A. A. N. Perwira
COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 127 : 213 - 224
[10] Iterated local search for the team orienteering problem with time windows
Vansteenwegen, Pieter
Souffriau, Wouter
Vanden Berghe, Greet
Van Oudheusden, Dirk
COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (12) : 3281 - 3290

← 1 2 3 4 5 →