Cooperative Sensing and Heterogeneous Information Fusion in VCPS: A Multi-Agent Deep Reinforcement Learning Approach

被引:1
|
作者
Xu, Xincao [1 ]
Liu, Kai [2 ]
Dai, Penglin [3 ,4 ]
Xie, Ruitao [5 ]
Cao, Jingjing [6 ]
Luo, Jiangtao [7 ]
机构
[1] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen 518110, Peoples R China
[2] Chongqing Univ, Coll Comp Sci, Chongqing 400040, Peoples R China
[3] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[4] Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
[5] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[6] Wuhan Univ Technol, Sch Transportat & Logist Engn, Wuhan 430063, Hubei, Peoples R China
[7] Chongqing Univ Posts & Telecommun, Elect Informat & Networking Res Inst, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Vehicular cyber-physical system; edge computing; cooperative sensing; heterogeneous information fusion; multi-agent deep reinforcement learning; DATA DISSEMINATION;
D O I
10.1109/TITS.2023.3340334
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Cooperative sensing and heterogeneous information fusion are critical to realize vehicular cyber-physical systems (VCPSs). This paper makes the first attempt to quantitatively measure the quality of VCPS by designing a new metric called Age of View (AoV). Specifically, we first present the system architecture where heterogeneous information can be cooperatively sensed and uploaded via vehicle-to-infrastructure (V2I) communications in vehicular edge computing (VEC). Logical views are constructed by fusing the heterogeneous information at edge nodes. Further, we formulate the problem by deriving a cooperative sensing model based on the multi-class M/G/1 priority queue, and defining the AoV by modeling the timeliness, completeness and consistency of the logical views. On this basis, a multi-agent difference reward based actor-critic with V2I bandwidth allocation (MDRAC-VBA) solution is proposed. In particular, the system state includes vehicle sensed information, edge cached information and view requirements. The vehicle action space consists of the sensing frequencies and uploading priorities of information. A difference-reward-based credit assignment is designed to divide the system reward, which is defined as the VCPS quality, into the difference reward for vehicles. Edge node allocates V2I bandwidth to vehicles based on predicted vehicle trajectories and view requirements. Finally, we build the simulation model and give a comprehensive performance evaluation, which conclusively demonstrates the superiority of MDRAC-VBA.
引用
收藏
页码:4876 / 4891
页数:16
相关论文
共 50 条
  • [1] A review of cooperative multi-agent deep reinforcement learning
    Afshin Oroojlooy
    Davood Hajinezhad
    [J]. Applied Intelligence, 2023, 53 : 13677 - 13722
  • [2] A review of cooperative multi-agent deep reinforcement learning
    Oroojlooy, Afshin
    Hajinezhad, Davood
    [J]. APPLIED INTELLIGENCE, 2023, 53 (11) : 13677 - 13722
  • [3] Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
    Liu, Iou-Jen
    Jain, Unnat
    Yeh, Raymond A.
    Schwing, Alexander G.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [4] Learning MAC Protocols in HetNets: A Cooperative Multi-Agent Deep Reinforcement Learning Approach
    Naeem, Faisal
    Adam, Nadir
    Kaddoum, Georges
    Waqar, Omer
    [J]. 2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [5] Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    [J]. ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS II: ADAPTATION AND MULTI-AGENT LEARNING, 2005, 3394 : 119 - 131
  • [6] Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
    Wan, Kejia
    Xu, Xinhai
    Li, Yuan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 544 - 555
  • [7] Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains
    Ocana, Jim Martin Catacora
    Riccio, Francesco
    Capobianco, Roberto
    Nardi, Daniele
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1865 - 1867
  • [8] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    [J]. Applied Intelligence, 2023, 53 : 9261 - 9269
  • [9] Transform networks for cooperative multi-agent deep reinforcement learning
    Wang, Hongbin
    Xie, Xiaodong
    Zhou, Lianke
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269
  • [10] Survey of Fully Cooperative Multi-Agent Deep Reinforcement Learning
    Zhao, Liyang
    Chang, Tianqing
    Chu, Kaixuan
    Guo, Libin
    Zhang, Lei
    [J]. Computer Engineering and Applications, 2023, 59 (12) : 14 - 27