Multi-objective reinforcement learning for bi-objective time-dependent pickup and delivery problem with late penalties

被引:2
|
作者
Santiyuda, Gemilang [1 ]
Wardoyo, Retantyo [1 ]
Pulungan, Reza [1 ]
Yu, Vincent F. [2 ]
机构
[1] Univ Gadjah Mada, Fac Math & Nat Sci, Dept Comp Sci & Elect, Yogyakarta, Indonesia
[2] Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei, Taiwan
关键词
Pickup and delivery problem; Multi-objective; Deep reinforcement learning; Attention mechanism; Hypernetwork; VEHICLE-ROUTING PROBLEM; OPTIMIZATION; EVOLUTIONARY; ALGORITHM; REPRESENTATIONS; NETWORK; MOEA/D; BRANCH; PRICE;
D O I
10.1016/j.engappai.2023.107381
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study addresses the bi-objective time-dependent pickup and delivery problem with late penalties (TDPDPLP). Incorporating time-dependent travel time into the problem formulation to model traffic congestion is critical, especially for problems with time-related costs, to decrease the difference in the projected quality of solutions when applying optimization methods in the real world. This study proposes a multi-objective reinforcement learning (MORL)-based method with hypernetwork and heterogeneous attention mechanism (HAM) with a two-stage training scheme to solve the bi-objective TDPDPLP. The proposed method can instantly generate an approximation of the Pareto optimal front (POF) after offline training. The conducted ablation study also showed that discarding coordinates from the features simplifies the model and saves several hours of training while improving the quality of the solutions. The performance of the trained model is evaluated on various instances, including real-world-based instances from Barcelona, Berlin, and Porto Alegre. The performance of the proposed method is evaluated based on the hypervolume (HV) and additive epsilon (epsilon+) of the generated POF. We compare the performance of the proposed method to another MORL method, namely the preference-conditioned multi-objective combinatorial optimization (PMOCO) and several well-known multiobjective evolutionary algorithms (MOEAs). Experiments showed that the proposed method performs better than PMOCO and the employed MOEAs on various problem instances. The trained method only needs minutes to generate a POF approximation, while the MOEA(s) require hours. Furthermore, it also generalizes well on different characteristics of problem instances and performs well on instances from cities other than the city in the training instances.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] A Hybrid Particle Swarm Optimization Algorithm for Multi-Objective Pickup and Delivery Problem with Time Windows
    Zou, Shuilong
    Li, Jin
    Li, Xueqian
    JOURNAL OF COMPUTERS, 2013, 8 (10) : 2583 - 2589
  • [32] Automatic Generation of Multi-objective ACO Algorithms for the Bi-objective Knapsack
    Bezerra, Leonardo C. T
    Lopez-Ibanez, Manuel
    Stutzle, Thomas
    SWARM INTELLIGENCE (ANTS 2012), 2012, 7461 : 37 - 48
  • [33] Bi-objective resource-constrained project scheduling problem with time-dependent resource costs
    Alcaraz, Javier
    Anton-Sanchez, Laura
    Saldanha-da-Gama, Francisco
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 63 : 506 - 523
  • [34] Bi-objective Inventory Management through Evolutionary Multi-objective Optimization
    Tsou, China-Shih
    Wu, Bo-Han
    Lee, Yina-Hao
    ECONOMICS, BUSINESS AND MANAGEMENT, 2011, 2 : 93 - 97
  • [35] Solving multi-objective optimization problems by a bi-objective evolutionary algorithm
    Wang, Yu-Ping
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1018 - 1024
  • [36] A Multi-Objective Water Cycle Algorithm for the BI-Objective Multi-Mode Project Resource Renting Problem
    Mahmoodi, Samin
    Najafi, Amir Abbas
    Ghods, Kian
    ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH, 2024, 58 (01): : 104 - 119
  • [37] An Efficient Multi-objective Evolutionary Algorithm for a Practical Dynamic Pickup and Delivery Problem
    Cai, Junchuang
    Zhu, Qingling
    Lin, Qiuzhen
    Li, Jianqiang
    Chen, Jianyong
    Ming, Zhong
    INTELLIGENT COMPUTING THEORIES AND APPLICATION (ICIC 2022), PT I, 2022, 13393 : 27 - 40
  • [38] Multi-objective pickup point location problem considering impact of home delivery
    Chen Y.
    Han X.
    Zeng Q.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2016, 22 (11): : 2679 - 2690
  • [39] Efficient meta-heuristics for the Multi-Objective Time-Dependent Orienteering Problem
    Mei, Yi
    Salim, Flora D.
    Li, Xiaodong
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 254 (02) : 443 - 457
  • [40] Multi-objective metaheuristics for a location-routing problem with simultaneous pickup and delivery
    Wang, Xuefeng
    2013 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2013, : 335 - 338