Multi-user reinforcement learning based multi-reward for spectrum access in cognitive vehicular networks

被引:1
|
作者
Chen, Lingling [1 ,2 ]
Zhao, Quanjun [1 ]
Fu, Ke [1 ]
Zhao, Xiaohui [3 ]
Sun, Hongliang [1 ,4 ]
机构
[1] Jilin Inst Chem Technol, Coll Informat & Control Engn, Jilin 132000, Peoples R China
[2] Jilin Univ, Coll Commun Engn, Changchun 130012, Peoples R China
[3] Jilin Univ, Coll Commun Engn, Key Lab Informat Sci, Changchun 130012, Peoples R China
[4] Jilin Univ, Dept Commun Engn, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognitive Vehicular Networks; Dynamic spectrum access; Vehicle-to-vehicle links; Vehicle-to-infrastructure links; Spectrum sensing errors; DYNAMIC MULTICHANNEL ACCESS; COMMUNICATION; PROTOCOL; VANETS; RADIOS;
D O I
10.1007/s11235-023-01004-6
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Cognitive Vehicular Networks (CVNs) can improve spectrum utilization by intelligently using idle spectrum, so as to fulfill the needs of communication. The previous researches only considered vehicle-to-vehicle(V2V) links or vehicle-to-infrastructure (V2I) links and ignored the influence of spectrum sensing errors. Therefore, in this paper, V2V links and V2I links are simultaneously discussed in the presence of spectrum sensing errors in the CVNs communication environment that we establish, and a dynamic spectrum access problem aiming at spectrum utilization is framed. In order to solve the above problems, the reinforcement learning method is introduced in this paper. But the impact of two kinds of collisions on the spectrum access rate of cognitive vehicles is neglected in the reinforcement learning method, and the above collisions which exist between cognitive vehicles, between cognitive vehicles and primary vehicles. Hence, different reward functions are designed according to different collision situations, and an improved reinforcement learning method is utilized to improve the success probability of spectrum access. To verify the effectiveness of the improved method, the performance and convergence of the proposed method are significantly better than other methods by comparing with the Myopic method, DQN and traditional DDQN in Python.
引用
收藏
页码:51 / 65
页数:15
相关论文
共 50 条
  • [1] Multi-user reinforcement learning based multi-reward for spectrum access in cognitive vehicular networks
    Lingling Chen
    Quanjun Zhao
    Ke Fu
    Xiaohui Zhao
    Hongliang Sun
    [J]. Telecommunication Systems, 2023, 83 : 51 - 65
  • [2] A multi-channel and multi-user dynamic spectrum access algorithm based on deep reinforcement learning in Cognitive Vehicular Networks with sensing error
    Chen, Lingling
    Fu, Ke
    Zhao, Quanjun
    Zhao, Xiaohui
    [J]. PHYSICAL COMMUNICATION, 2022, 55
  • [3] Multi-user Dynamic Spectrum Access Based on Reinforcement Learning
    Xu, Jinming
    Dou, Zheng
    Qi, Lin
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [4] Vibration Control with Reinforcement Learning Based on Multi-Reward Lightweight Networks
    Shu, Yucheng
    He, Chaogang
    Qiao, Lihong
    Xiao, Bin
    Li, Weisheng
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [5] Deep Multi-User Reinforcement Learning for Dynamic Spectrum Access in Multichannel Wireless Networks
    Naparstek, Oshri
    Cohen, Kobi
    [J]. GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [6] A dynamic spectrum access algorithm based on deep reinforcement learning with novel multi-vehicle reward functions in cognitive vehicular networks
    Chen, Lingling
    Wang, Ziwei
    Zhao, Xiaohui
    Shen, Xuan
    He, Wei
    [J]. TELECOMMUNICATION SYSTEMS, 2024, 87 (02) : 359 - 383
  • [7] Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access
    Naparstek, Oshri
    Cohen, Kobi
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (01) : 310 - 323
  • [8] Deep Reinforcement Learning for Multi-User Access Control in UAV Networks
    Cao, Yang
    Zhang, Lin
    Liang, Ying-Chang
    [J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [9] OPPORTUNISTIC SPECTRUM ACCESS IN MULTI-USER MULTI-CHANNEL COGNITIVE RADIO NETWORKS
    Shetty, Sachin
    Agbedanu, Kodzo
    Ramachandran, Ravi
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1229 - 1233
  • [10] Multi-Reward Architecture based Reinforcement Learning for Highway Driving Policies
    Yuan, Wei
    Yang, Ming
    He, Yuesheng
    Wang, Chunxiang
    Wang, Bing
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3810 - 3815