Cooperative Multi-Agent Reinforcement-Learning-Based Distributed Dynamic Spectrum Access in Cognitive Radio Networks

被引:27
|
作者
Tan, Xiang [1 ]
Zhou, Li [1 ]
Wang, Haijun [1 ]
Sun, Yuli [1 ]
Zhao, Haitao [1 ]
Seet, Boon-Chong [2 ]
Wei, Jibo [1 ]
Leung, Victor C. M. [3 ,4 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
[2] Auckland Univ Technol, Dept Elect & Elect Engn, Auckland 1142, New Zealand
[3] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[4] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
基金
中国国家自然科学基金;
关键词
Cognitive radio networks; cooperative game; decentralized partially observable Markov decision process (Dec-POMDP); deep recurrent Q-network (DRQN); dynamic spectrum access (DSA); Markov game; multi-agent reinforcement learning (MARL); INTELLIGENT REFLECTING SURFACE; OPTIMALITY;
D O I
10.1109/JIOT.2022.3168296
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of wireless communication and Internet of Things (IoT), there are massive wireless devices that need to share the limited spectrum resources. Dynamic spectrum access (DSA) is a promising paradigm to remedy the problem of inefficient spectrum utilization brought upon by the historical command-and-control approach to spectrum allocation. In this article, we investigate the distributed DSA problem for multiusers in a typical multichannel cognitive radio network. The problem is formulated as a decentralized partially observable Markov decision process (Dec-POMDP), and we propose a centralized off-line training and distributed online execution framework based on cooperative multi-agent reinforcement learning (MARL). We employ the deep recurrent Q-network (DRQN) to address the partial observability of the state for each cognitive user. The ultimate goal is to learn a cooperative strategy which maximizes the sum throughput of a cognitive radio network in a distributed fashion without information exchange between cognitive users. Finally, we validate the proposed algorithm in various settings through extensive experiments. The experimental results show that the proposed CoMARL-DSA algorithm outperforms the state-of-the-art deep Q-learning for spectrum access (DQSA) in terms of successful access rate and collision rate by at least 14% and 12%, respectively.
引用
下载
收藏
页码:19477 / 19488
页数:12
相关论文
共 50 条
  • [21] Cooperative Cache in Cognitive Radio Networks: A Heterogeneous Multi-Agent Learning Approach
    Gao, Ang
    Liu, Hengtong
    Hu, Yansu
    Liang, Wei
    Ng, Soon Xin
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (05) : 1032 - 1036
  • [22] DMADRL: A Distributed Multi-agent Deep Reinforcement Learning Algorithm for Cognitive Offloading in Dynamic MEC Networks
    Yi, Meng
    Yang, Peng
    Du, Miao
    Ma, Ruochen
    NEURAL PROCESSING LETTERS, 2022, 54 (05) : 4341 - 4373
  • [23] DMADRL: A Distributed Multi-agent Deep Reinforcement Learning Algorithm for Cognitive Offloading in Dynamic MEC Networks
    Meng Yi
    Peng Yang
    Miao Du
    Ruochen Ma
    Neural Processing Letters, 2022, 54 : 4341 - 4373
  • [24] Cooperative Reinforcement Learning Algorithm to Distributed Power System Based on Multi-Agent
    Gao, La-mei
    Zeng, Jun
    Wu, Jie
    Li, Min
    2009 3RD INTERNATIONAL CONFERENCE ON POWER ELECTRONICS SYSTEMS AND APPLICATIONS: ELECTRIC VEHICLE AND GREEN ENERGY, 2009, : 53 - 53
  • [25] Deep-Reinforcement-Learning-Based Distributed Dynamic Spectrum Access in Multiuser Multichannel Cognitive Radio Internet of Things Networks
    Zhang, Xiaohui
    Chen, Ze
    Zhang, Yinghui
    Liu, Yang
    Jin, Minglu
    Qiu, Tianshuang
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 17495 - 17509
  • [26] Multi-agent reinforcement learning based optimal energy sensing threshold control in distributed cognitive radio networks with directional antenna
    Pham, Thi Thu Hien
    Noh, Wonjong
    Cho, Sungrae
    ICT EXPRESS, 2024, 10 (03): : 472 - 478
  • [27] Dynamic spectrum access based on deep reinforcement learning for multiple access in cognitive radio
    Li, Zeng-qi
    Liu, Xin
    Ning, Zhao-long
    PHYSICAL COMMUNICATION, 2022, 54
  • [28] Spectrum Sharing in Vehicular Networks Based on Multi-Agent Reinforcement Learning
    Liang, Le
    Ye, Hao
    Li, Geoffrey Ye
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) : 2282 - 2292
  • [29] Reinforcement Learning for Opportunistic Spectrum Access in Cognitive Radio Networks
    Zhao, Fie
    Qu, Daiming
    Zhong, Guohui
    Cao, Yang
    2010 INTERNATIONAL CONFERENCE ON COMMUNICATION AND VEHICULAR TECHNOLOGY (ICCVT 2010), VOL I, 2010, : 116 - 120
  • [30] Scalable Multi-Agent Reinforcement Learning-Based Distributed Channel Access
    Chen, Zhenyu
    Sun, Xinghua
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 453 - 458