Dynamic Spectrum Access for D2D-Enabled Internet of Things: A Deep Reinforcement Learning Approach

被引:11
|
作者
Huang, Jingfei [1 ,2 ]
Yang, Yang [1 ,2 ]
Gao, Zhen [3 ,4 ]
He, Dazhong [1 ,2 ]
Ng, Derrick Wing Kwan [5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Ctr Data Sci, Beijing 100876, Peoples R China
[3] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 211189, Jiangsu, Peoples R China
[4] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
[5] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2025, Australia
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Device-to-device (D2D) communication; deep reinforcement learning (DRL); dynamic spectrum access; Internet of Things (IoT); RESOURCE-ALLOCATION; COMMUNICATION; SELECTION; NETWORKS;
D O I
10.1109/JIOT.2022.3160197
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Device-to-device (D2D) communication is regarded as a promising technology to support spectral-efficient Internet of Things (IoT) in beyond fifth-generation (5G) and sixth-generation (6G) networks. This article investigates the spectrum access problem for D2D-assisted cellular networks based on deep reinforcement learning (DRL), which can be applied to both the uplink and downlink scenarios. Specifically, we consider a time-slotted cellular network, where D2D nodes share the cellular spectrum resources (CUEs) with cellular users in a time-splitting manner. Besides, D2D nodes could reuse time slots preoccupied by CUEs according to a location-based spectrum access (LSA) strategy on the premise of cellular communication quality. The key challenge lies in that D2D nodes have no information on the LSA strategy and the access principle of CUEs. Thus, we design a DRL-based spectrum access scheme such that the D2D nodes can autonomously acquire an optimal strategy for efficient spectrum access without any prior knowledge to achieve a specific objective such as maximizing the normalized sum throughput. Moreover, we adopt a generalized double deep Q-network (DDQN) algorithm and extend the objective function to explore the resource allocation fairness for D2D nodes. The proposed scheme is evaluated under various conditions and our simulation results show that it can achieve the near-optimal throughput performance with different objectives compared to the benchmark, which is the theoretical throughput upper bound derived from a genius-aided scheme with complete system knowledge available.
引用
下载
收藏
页码:17793 / 17807
页数:15
相关论文
共 50 条
  • [1] Dynamic Downlink Spectrum Access for D2D-Enabled Heterogeneous Networks
    Radaydeh, Redha M.
    A-Qahtani, Fawaz S.
    Celik, Abdulkadir
    Alouini, Mohamed-Slim
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [2] Energy Minimization in D2D-Assisted Cache-Enabled Internet of Things: A Deep Reinforcement Learning Approach
    Tang, Jie
    Tang, Hengbin
    Zhang, Xiuyin
    Cumanan, Kanapathippillai
    Chen, Gaojie
    Wong, Kai-Kit
    Chambers, Jonathon
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (08) : 5412 - 5423
  • [3] A Reinforcement Learning Approach to Dynamic Spectrum Access in Internet-of-Things Networks
    Cha, Han
    Kim, Seong-Lyun
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [4] A Reinforcement Learning-Based Approach to Graph Discovery in D2D-Enabled Federated Learning
    Wagle, Satyavrat
    Das, Anindya Bijoy
    Love, David J.
    Brinton, Christopher G.
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 225 - 230
  • [5] Dynamic Spectrum Access for Internet-of-Things Based on Federated Deep Reinforcement Learning
    Li, Feng
    Shen, Bowen
    Guo, Jiale
    Lam, Kwok-Yan
    Wei, Guiyi
    Wang, Li
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (07) : 7952 - 7956
  • [6] Dynamic spectrum access for Internet-of-Things with hierarchical federated deep reinforcement learning
    Zhang, Songbo
    Lam, Kwok-Yan
    Shen, Bowen
    Wang, Li
    Li, Feng
    AD HOC NETWORKS, 2023, 149
  • [7] Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band
    Asheralieva, Alia
    Miyanaga, Yoshikazu
    MOBILE INFORMATION SYSTEMS, 2016, 2016
  • [8] Internet of Reliable Things: Toward D2D-enabled NB-IoT
    Malarski, Krzysztof Mateusz
    Moradi, Farnaz
    Ballal, Kalpit Dilip
    Dittmann, Lars
    Ruepp, Sarah
    2020 FIFTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), 2020, : 196 - 201
  • [9] Resource Allocation in Information-Centric Wireless Networking With D2D-Enabled MEC: A Deep Reinforcement Learning Approach
    Wang, Dan
    Qin, Hao
    Song, Bin
    Du, Xiaojiang
    Guizani, Mohsen
    IEEE ACCESS, 2019, 7 : 114935 - 114944
  • [10] Dynamic multiple access based on deep reinforcement learning for Internet of Things
    Liu, Xin
    Li, Zengqi
    COMPUTER COMMUNICATIONS, 2023, 210 : 331 - 341