Multi-Agent Decision-Making Modes in Uncertain Interactive Traffic Scenarios via Graph Convolution-Based Deep Reinforcement Learning

被引:12
|
作者
Gao, Xin [1 ]
Li, Xueyuan [1 ]
Liu, Qi [1 ]
Li, Zirui [1 ,2 ]
Yang, Fan [1 ]
Luan, Tian [1 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, Beijing 100080, Peoples R China
[2] Delft Univ Technol, Fac Civil Engn & Geosci, Dept Transport & Planning, Stevinweg 1, NL-2628 CN Delft, Netherlands
关键词
multi-mode decision-making; connected autonomous vehicles; reward function matrix; uncertain highway exit scene; GQN; MDGQN; PLANNING PROCESS; AUTONOMOUS VANS;
D O I
10.3390/s22124586
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
As one of the main elements of reinforcement learning, the design of the reward function is often not given enough attention when reinforcement learning is used in concrete applications, which leads to unsatisfactory performances. In this study, a reward function matrix is proposed for training various decision-making modes with emphasis on decision-making styles and further emphasis on incentives and punishments. Additionally, we model a traffic scene via graph model to better represent the interaction between vehicles, and adopt the graph convolutional network (GCN) to extract the features of the graph structure to help the connected autonomous vehicles perform decision-making directly. Furthermore, we combine GCN with deep Q-learning and multi-step double deep Q-learning to train four decision-making modes, which are named the graph convolutional deep Q-network (GQN) and the multi-step double graph convolutional deep Q-network (MDGQN). In the simulation, the superiority of the reward function matrix is proved by comparing it with the baseline, and evaluation metrics are proposed to verify the performance differences among decision-making modes. Results show that the trained decision-making modes can satisfy various driving requirements, including task completion rate, safety requirements, comfort level, and completion efficiency, by adjusting the weight values in the reward function matrix. Finally, the decision-making modes trained by MDGQN had better performance in an uncertain highway exit scene than those trained by GQN.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [12] MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning
    Hu, Tianmeng
    Luo, Biao
    Yang, Chunhua
    Huang, Tingwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12098 - 12112
  • [13] GLOBAL-LOCALIZED AGENT GRAPH CONVOLUTION FOR MULTI-AGENT REINFORCEMENT LEARNING
    Liu, Yuntao
    Dou, Yong
    Shen, Siqi
    Qiao, Peng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3480 - 3484
  • [14] Multi-agent Decision-making at Unsignalized Intersections with Reinforcement Learning from Demonstrations
    Huang, Chang
    Zhao, Junqiao
    Zhou, Hongtu
    Zhang, Hai
    Zhang, Xiao
    Ye, Chen
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [15] LLM-guided decision-making toolkit for multi-agent reinforcement learning
    Li, Zhemin
    Zhang, Ruobing
    Wang, Zhengming
    Xie, Zheng
    Song, Yiping
    NEUROCOMPUTING, 2025, 638
  • [16] Nash double Q-based multi-agent deep reinforcement learning for interactive in mixed traffic
    Li, Lin
    Zhao, Wanzhong
    Wang, Chunyan
    Fotouhi, Abbas
    Liu, Xuze
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [17] Multi-agent deep reinforcement learning-based autonomous decision-making framework for community virtual power plants
    Li, Xiangyu
    Luo, Fengji
    Li, Chaojie
    APPLIED ENERGY, 2024, 360
  • [18] Deep Reinforcement Learning for Ecological and Distributed Urban Traffic Signal Control with Multi-Agent Equilibrium Decision Making
    Yan, Liping
    Wang, Jing
    ELECTRONICS, 2024, 13 (10)
  • [19] Graph Convolution Reinforcement Learning for Decision-Making in Highway Overtaking Scenario
    Meng Xiaoqiang
    Yang Fan
    Li Xueyuan
    Liu Qi
    Gao Xin
    Li Zirui
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 417 - 422
  • [20] Research of Decision-making in the Multi-Agent System Based on Interactive Influence Diagrams
    Li, Bo
    Luo, Jian
    Zhuang, Jinfa
    MATERIALS, MECHATRONICS AND AUTOMATION, PTS 1-3, 2011, 467-469 : 1947 - +