Cooperative Multi-Agent Reinforcement-Learning-Based Distributed Dynamic Spectrum Access in Cognitive Radio Networks

被引:27
|
作者
Tan, Xiang [1 ]
Zhou, Li [1 ]
Wang, Haijun [1 ]
Sun, Yuli [1 ]
Zhao, Haitao [1 ]
Seet, Boon-Chong [2 ]
Wei, Jibo [1 ]
Leung, Victor C. M. [3 ,4 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
[2] Auckland Univ Technol, Dept Elect & Elect Engn, Auckland 1142, New Zealand
[3] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[4] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
基金
中国国家自然科学基金;
关键词
Cognitive radio networks; cooperative game; decentralized partially observable Markov decision process (Dec-POMDP); deep recurrent Q-network (DRQN); dynamic spectrum access (DSA); Markov game; multi-agent reinforcement learning (MARL); INTELLIGENT REFLECTING SURFACE; OPTIMALITY;
D O I
10.1109/JIOT.2022.3168296
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of wireless communication and Internet of Things (IoT), there are massive wireless devices that need to share the limited spectrum resources. Dynamic spectrum access (DSA) is a promising paradigm to remedy the problem of inefficient spectrum utilization brought upon by the historical command-and-control approach to spectrum allocation. In this article, we investigate the distributed DSA problem for multiusers in a typical multichannel cognitive radio network. The problem is formulated as a decentralized partially observable Markov decision process (Dec-POMDP), and we propose a centralized off-line training and distributed online execution framework based on cooperative multi-agent reinforcement learning (MARL). We employ the deep recurrent Q-network (DRQN) to address the partial observability of the state for each cognitive user. The ultimate goal is to learn a cooperative strategy which maximizes the sum throughput of a cognitive radio network in a distributed fashion without information exchange between cognitive users. Finally, we validate the proposed algorithm in various settings through extensive experiments. The experimental results show that the proposed CoMARL-DSA algorithm outperforms the state-of-the-art deep Q-learning for spectrum access (DQSA) in terms of successful access rate and collision rate by at least 14% and 12%, respectively.
引用
收藏
页码:19477 / 19488
页数:12
相关论文
共 50 条
  • [1] A Cooperative Spectrum Sensing With Multi-Agent Reinforcement Learning Approach in Cognitive Radio Networks
    Gao, Ang
    Du, Chengyuan
    Ng, Soon Xin
    Liang, Wei
    [J]. IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2604 - 2608
  • [2] Reinforcement-Learning-Based Double Auction Design for Dynamic Spectrum Access in Cognitive Radio Networks
    Yinglei Teng
    F. Richard Yu
    Ke Han
    Yifei Wei
    Yong Zhang
    [J]. Wireless Personal Communications, 2013, 69 : 771 - 791
  • [3] Reinforcement-Learning-Based Double Auction Design for Dynamic Spectrum Access in Cognitive Radio Networks
    Teng, Yinglei
    Yu, F. Richard
    Han, Ke
    Wei, Yifei
    Zhang, Yong
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2013, 69 (02) : 771 - 791
  • [4] Multi-Agent Reinforcement Learning-Based Distributed Dynamic Spectrum Access
    Albinsaid, Hasan
    Singh, Keshav
    Biswas, Sudip
    Li, Chih-Peng
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2022, 8 (02) : 1174 - 1185
  • [5] Attention Enhanced Multi-Agent Reinforcement Learning for Cooperative Spectrum Sensing in Cognitive Radio Networks
    Gao, Ang
    Wang, Qinyu
    Wang, Yongze
    Du, Chengyuan
    Hu, Yansu
    Liang, Wei
    Ng, Soon Xin
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 10464 - 10477
  • [6] Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks
    Jiang, Weiheng
    Yu, Wanxin
    Wang, Wenbo
    Huang, Tiancong
    [J]. SENSORS, 2022, 22 (04)
  • [7] Multi-Agent Reinforcement Learning for Dynamic Spectrum Access
    Jiang, Huijuan
    Wang, Tianyu
    Wang, Shaowei
    [J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [8] Distributed cooperative spectrum sensing based on reinforcement learning in cognitive radio networks
    Zhang, Mengbo
    Wang, Lunwen
    Feng, Yanqing
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2018, 94 : 359 - 366
  • [9] Distributed Cooperative Spectrum Sharing in UAV Networks Using Multi-Agent Reinforcement Learning
    Shamsoshoara, Alireza
    Khaledi, Mehrdad
    Afghah, Fatemeh
    Razi, Abolfazl
    Ashdown, Jonathan
    [J]. 2019 16TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2019,
  • [10] Multi-agent Q-learning of Spectrum Access in Distributed Cognitive Radio Network
    Min Neng
    Wu Qi-hui
    Xu Yu-hua
    Ding Guo-ru
    [J]. INTERNATIONAL CONFERENCE OF CHINA COMMUNICATION (ICCC2010), 2010, : 656 - 660