Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

被引:0
|
作者
Mak, Stephen [1 ,4 ]
Xu, Liming [1 ]
Pearce, Tim [2 ,5 ]
Ostroumov, Michael [3 ]
Brintrup, Alexandra [1 ]
机构
[1] Univ Cambridge, Inst Mfg, Dept Engn, Cambridge, England
[2] Microsoft Res Cambridge, Cambridge, England
[3] Value Chain Lab, London, England
[4] 17 Charles Babbage Rd, Cambridge CB3 0FS, England
[5] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Collaborative vehicle routing; Deep multi-agent reinforcement learning; Negotiation; Gain sharing; Multi-agent systems; Machine learning; HORIZONTAL COOPERATION; ALLOCATION; LEVEL; COST; GAME;
D O I
10.1016/j.trc.2023.104376
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Collaborative vehicle routing occurs when carriers collaborate through sharing their transporta-tion requests and performing transportation requests on behalf of each other. This achieves economies of scale, thus reducing cost, greenhouse gas emissions and road congestion. But which carrier should partner with whom, and how much should each carrier be compensated? Traditional game theoretic solution concepts are expensive to calculate as the characteristic function scales exponentially with the number of agents. This would require solving the vehicle routing problem (NP-hard) an exponential number of times. We therefore propose to model this problem as a coalitional bargaining game solved using deep multi-agent reinforcement learning, where - crucially - agents are not given access to the characteristic function. Instead, we implicitly reason about the characteristic function; thus, when deployed in production, we only need to evaluate the expensive post-collaboration vehicle routing problem once. Our contribution is that we are the first to consider both the route allocation problem and gain sharing problem simultaneously - without access to the expensive characteristic function. Through decentralised machine learning, our agents bargain with each other and agree to outcomes that correlate well with the Shapley value - a fair profit allocation mechanism. Importantly, we are able to achieve a reduction in run-time of 88%.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] A multi-agent deep reinforcement learning approach for traffic signal coordination
    Hu, Ta-Yin
    Li, Zhuo-Yu
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (08) : 1428 - 1444
  • [22] A deep reinforcement learning approach for multi-agent mobile robot patrolling
    Jana, Meghdeep
    Vachhani, Leena
    Sinha, Arpita
    INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2022, 6 (04) : 724 - 745
  • [23] A reinforcement learning approach for developing routing policies in multi-agent production scheduling
    Wang, Yi-Chi
    Usher, John M.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 33 (3-4): : 323 - 333
  • [24] A deep reinforcement learning approach for multi-agent mobile robot patrolling
    Meghdeep Jana
    Leena Vachhani
    Arpita Sinha
    International Journal of Intelligent Robotics and Applications, 2022, 6 : 724 - 745
  • [25] Load Frequency Control: A Deep Multi-Agent Reinforcement Learning Approach
    Rozada, Sergio
    Apostolopoulou, Dimitra
    Alonso, Eduardo
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [26] Avoiding collaborative paradox in multi-agent reinforcement learning
    Kim, Hyunseok
    Kim, Seonghyun
    Lee, Donghun
    Jang, Ingook
    ETRI JOURNAL, 2021, 43 (06) : 1004 - 1012
  • [27] An Incremental Approach for Multi-Agent Deep Reinforcement Learning for Multicriteria Missions
    Cysne, Nicholas Scharan
    Ribeiro, Carlos Henrique Costa
    Ghedini, Cinara Guellner
    2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
  • [28] Federated Multi-Agent Deep Reinforcement Learning for Resource Allocation of Vehicle-to-Vehicle Communications
    Li, Xiang
    Lu, Lingyun
    Ni, Wei
    Jamalipour, Abbas
    Zhang, Dalin
    Du, Haifeng
    IEEE Transactions on Vehicular Technology, 2022, 71 (08): : 8810 - 8824
  • [29] Federated Multi-Agent Deep Reinforcement Learning for Resource Allocation of Vehicle-to-Vehicle Communications
    Li, Xiang
    Lu, Lingyun
    Ni, Wei
    Jamalipour, Abbas
    Zhang, Dalin
    Du, Haifeng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (08) : 8810 - 8824
  • [30] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176