Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

被引：0

作者：

Mak, Stephen ^{[1
,4
]}

Xu, Liming ^{[1
]}

Pearce, Tim ^{[2
,5
]}

Ostroumov, Michael ^{[3
]}

Brintrup, Alexandra ^{[1
]}

机构：

[1] Univ Cambridge, Inst Mfg, Dept Engn, Cambridge, England

[2] Microsoft Res Cambridge, Cambridge, England

[3] Value Chain Lab, London, England

[4] 17 Charles Babbage Rd, Cambridge CB3 0FS, England

[5] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2023年 / 157卷

基金：

英国工程与自然科学研究理事会;

关键词：

Collaborative vehicle routing; Deep multi-agent reinforcement learning; Negotiation; Gain sharing; Multi-agent systems; Machine learning; HORIZONTAL COOPERATION; ALLOCATION; LEVEL; COST; GAME;

D O I：

10.1016/j.trc.2023.104376

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Collaborative vehicle routing occurs when carriers collaborate through sharing their transporta-tion requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solution concepts are expensive to calculate as the characteristic function scales exponentially with the number of agents. This would require solving the vehicle routing problem (NP-hard) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game solved using deep multi-agent reinforcement learning, where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function; thus, when deployed in production, we only need to evaluate the expensive post-collaboration vehicle routing problem once. Our contribution is that we are the first to consider both the route allocation problem and gain sharing problem simultaneously - without access to the expensive characteristic function. Through decentralised machine learning, our agents bargain with each other and agree to outcomes that correlate well with the Shapley value - a fair profit allocation mechanism. Importantly, we are able to achieve a reduction in run-time of 88%.

引用

页数：25

共 50 条

[21] A multi-agent deep reinforcement learning approach for traffic signal coordination
Hu, Ta-Yin
Li, Zhuo-Yu
IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (08) : 1428 - 1444
[22] A deep reinforcement learning approach for multi-agent mobile robot patrolling
Jana, Meghdeep
Vachhani, Leena
Sinha, Arpita
INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2022, 6 (04) : 724 - 745
[23] A reinforcement learning approach for developing routing policies in multi-agent production scheduling
Wang, Yi-Chi
Usher, John M.
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 33 (3-4): : 323 - 333
[24] A deep reinforcement learning approach for multi-agent mobile robot patrolling
Meghdeep Jana
Leena Vachhani
Arpita Sinha
International Journal of Intelligent Robotics and Applications, 2022, 6 : 724 - 745
[25] Load Frequency Control: A Deep Multi-Agent Reinforcement Learning Approach
Rozada, Sergio
Apostolopoulou, Dimitra
Alonso, Eduardo
2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
[26] Avoiding collaborative paradox in multi-agent reinforcement learning
Kim, Hyunseok
Kim, Seonghyun
Lee, Donghun
Jang, Ingook
ETRI JOURNAL, 2021, 43 (06) : 1004 - 1012
[27] An Incremental Approach for Multi-Agent Deep Reinforcement Learning for Multicriteria Missions
Cysne, Nicholas Scharan
Ribeiro, Carlos Henrique Costa
Ghedini, Cinara Guellner
2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
[28] Federated Multi-Agent Deep Reinforcement Learning for Resource Allocation of Vehicle-to-Vehicle Communications
Li, Xiang
Lu, Lingyun
Ni, Wei
Jamalipour, Abbas
Zhang, Dalin
Du, Haifeng
IEEE Transactions on Vehicular Technology, 2022, 71 (08): : 8810 - 8824
[29] Federated Multi-Agent Deep Reinforcement Learning for Resource Allocation of Vehicle-to-Vehicle Communications
Li, Xiang
Lu, Lingyun
Ni, Wei
Jamalipour, Abbas
Zhang, Dalin
Du, Haifeng
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (08) : 8810 - 8824
[30] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
Malysheva, Aleksandra
Kudenko, Daniel
Shpilman, Aleksei
2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176

← 1 2 3 4 5 →