A Learnheuristic Algorithm Based on Thompson Sampling for the Heterogeneous and Dynamic Team Orienteering Problem

被引:0
|
作者
Uguina, Antonio R. [1 ]
Gomez, Juan F. [1 ]
Panadero, Javier [2 ]
Martinez-Gavara, Anna [3 ]
Juan, Angel A. [1 ]
机构
[1] Univ Politecn Valencia, Res Ctr Prod Management & Engn, Alcoy 03801, Spain
[2] Univ Autonoma Barcelona, Dept Comp Architecture & Operating Syst, Bellaterra 08193, Spain
[3] Univ Valencia, Stat & Operat Res Dept, Doctor Moliner 50, Burjassot 46100, Valencia, Spain
关键词
combinatorial optimization; team orienteering problem; reinforcement learning; learnheuristics; UNMANNED AERIAL VEHICLES; PATH RELINKING; DRONES; GRASP;
D O I
10.3390/math12111758
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The team orienteering problem (TOP) is a well-studied optimization challenge in the field of Operations Research, where multiple vehicles aim to maximize the total collected rewards within a given time limit by visiting a subset of nodes in a network. With the goal of including dynamic and uncertain conditions inherent in real-world transportation scenarios, we introduce a novel dynamic variant of the TOP that considers real-time changes in environmental conditions affecting reward acquisition at each node. Specifically, we model the dynamic nature of environmental factors-such as traffic congestion, weather conditions, and battery level of each vehicle-to reflect their impact on the probability of obtaining the reward when visiting each type of node in a heterogeneous network. To address this problem, a learnheuristic optimization framework is proposed. It combines a metaheuristic algorithm with Thompson sampling to make informed decisions in dynamic environments. Furthermore, we conduct empirical experiments to assess the impact of varying reward probabilities on resource allocation and route planning within the context of this dynamic TOP, where nodes might offer a different reward behavior depending upon the environmental conditions. Our numerical results indicate that the proposed learnheuristic algorithm outperforms static approaches, achieving up to 25% better performance in highly dynamic scenarios. Our findings highlight the effectiveness of our approach in adapting to dynamic conditions and optimizing decision-making processes in transportation systems.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Personalized Dynamic Pricing Based on Improved Thompson Sampling
    Bi, Wenjie
    Wang, Bing
    Liu, Haiying
    MATHEMATICS, 2024, 12 (08)
  • [42] Efficient Cluster-Based Heuristics for the Team Orienteering Problem with Time Windows
    Gavalas, Damianos
    Konstantopoulos, Charalampos
    Mastakas, Konstantinos
    Pantziou, Grammati
    ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2019, 36 (01)
  • [43] Hybrid dynamic programming with bounding algorithm for the multi-profit orienteering problem
    Kim, Hyunjoon
    Kim, Byung-In
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 303 (02) : 550 - 566
  • [44] A SAMPLING ALGORITHM BASED ON SUPERVISED LEARNING FOR A DYNAMIC TRAVELING SALESMAN PROBLEM
    Jiang, Jiahao
    Gao, Jian
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2022, 23 (01): : 79 - 88
  • [45] A Two-Stage Resolution Search-Based Heuristic for the Team Orienteering Problem
    Hifi, Mhand
    Ibrahim, Moussa
    Saadi, Toufik
    2014 INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD), 2014, : 442 - 447
  • [46] A cutting-plane algorithm for the Steiner team orienteering problem (vol 135, pg 922, 2019)
    Assuncao, Lucas
    Mateus, Geraldo Robson
    COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 141
  • [47] A multi-objective evolutionary algorithm based on decomposition and constraint programming for the multi-objective team orienteering problem with time windows
    Hu, Wanzhe
    Fathi, Mahdi
    Pardalos, Panos M.
    APPLIED SOFT COMPUTING, 2018, 73 : 383 - 393
  • [48] The Team Orienteering Problem with Time Windows: An LP-based Granular Variable Neighborhood Search
    Labadie, Nacima
    Mansini, Renata
    Melechovsky, Jan
    Calvo, Roberto Wolfler
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 220 (01) : 15 - 27
  • [49] A GRASP to solve the multi-constraints multi-modal team orienteering problem with time windows for groups with heterogeneous preferences
    Ruiz-Meza, Jose
    Brito, Julio
    Montoya-Torres, Jairo R.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 162
  • [50] Online nonparametric monitoring of heterogeneous data streams with partial observations based on Thompson sampling
    Ye, Honghan
    Xian, Xiaochen
    Cheng, Jing-Ru C.
    Hable, Brock
    Shannon, Robert W.
    Elyaderani, Mojtaba Kadkhodaie
    Liu, Kaibo
    IISE TRANSACTIONS, 2023, 55 (04) : 392 - 404