Dynamic Spectrum Access for D2D-Enabled Internet of Things: A Deep Reinforcement Learning Approach

被引:11
|
作者
Huang, Jingfei [1 ,2 ]
Yang, Yang [1 ,2 ]
Gao, Zhen [3 ,4 ]
He, Dazhong [1 ,2 ]
Ng, Derrick Wing Kwan [5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Ctr Data Sci, Beijing 100876, Peoples R China
[3] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 211189, Jiangsu, Peoples R China
[4] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
[5] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2025, Australia
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Device-to-device (D2D) communication; deep reinforcement learning (DRL); dynamic spectrum access; Internet of Things (IoT); RESOURCE-ALLOCATION; COMMUNICATION; SELECTION; NETWORKS;
D O I
10.1109/JIOT.2022.3160197
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Device-to-device (D2D) communication is regarded as a promising technology to support spectral-efficient Internet of Things (IoT) in beyond fifth-generation (5G) and sixth-generation (6G) networks. This article investigates the spectrum access problem for D2D-assisted cellular networks based on deep reinforcement learning (DRL), which can be applied to both the uplink and downlink scenarios. Specifically, we consider a time-slotted cellular network, where D2D nodes share the cellular spectrum resources (CUEs) with cellular users in a time-splitting manner. Besides, D2D nodes could reuse time slots preoccupied by CUEs according to a location-based spectrum access (LSA) strategy on the premise of cellular communication quality. The key challenge lies in that D2D nodes have no information on the LSA strategy and the access principle of CUEs. Thus, we design a DRL-based spectrum access scheme such that the D2D nodes can autonomously acquire an optimal strategy for efficient spectrum access without any prior knowledge to achieve a specific objective such as maximizing the normalized sum throughput. Moreover, we adopt a generalized double deep Q-network (DDQN) algorithm and extend the objective function to explore the resource allocation fairness for D2D nodes. The proposed scheme is evaluated under various conditions and our simulation results show that it can achieve the near-optimal throughput performance with different objectives compared to the benchmark, which is the theoretical throughput upper bound derived from a genius-aided scheme with complete system knowledge available.
引用
下载
收藏
页码:17793 / 17807
页数:15
相关论文
共 50 条
  • [21] Federated Reinforcement Learning-Based Resource Allocation in D2D-Enabled 6G
    Guo, Qi
    Tang, Fengxiao
    Kato, Nei
    IEEE NETWORK, 2023, 37 (05): : 89 - 95
  • [22] Resource Allocation in D2D-Enabled 5G Networks Using Multiagent Reinforcement Learning
    Agyekum, Kwame Opuni-Boachie Obour
    Boakye, Alex Yaw
    Appati, Benedict
    Opoku, Jochebed Akoto
    Agyemang, Justice Owusu
    Boateng, Gordon Owusu
    Gadze, James Dzisi
    JOURNAL OF COMPUTER NETWORKS AND COMMUNICATIONS, 2024, 2024
  • [23] Energy Efficiency and Spectrum Efficiency Tradeoff in the D2D-Enabled HetNet
    Gao, Hui
    Wang, Min
    Lv, Tiejun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (11) : 10583 - 10587
  • [24] Spectrum Sharing and Scheduling in D2D-Enabled Dense Cellular Networks
    Krishnasamy, Subhashini
    Shakkottai, Sanjay
    2015 13TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2015, : 307 - 314
  • [25] Multichannel spectrum access based on reinforcement learning in cognitive internet of things
    Sun, Can
    Ding, Hua
    Liu, Xin
    AD HOC NETWORKS, 2020, 106 (106)
  • [26] Deep-Reinforcement-Learning-Based Distributed Dynamic Spectrum Access in Multiuser Multichannel Cognitive Radio Internet of Things Networks
    Zhang, Xiaohui
    Chen, Ze
    Zhang, Yinghui
    Liu, Yang
    Jin, Minglu
    Qiu, Tianshuang
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 17495 - 17509
  • [27] A New Centralized Access Control Scheme for D2D-Enabled mmWave Networks
    Panno, Daniela
    Riolo, Salvatore
    IEEE ACCESS, 2019, 7 : 80697 - 80716
  • [28] Opportunistic access control for enhancing security in D2D-enabled cellular networks
    Chen, Yajun
    Ji, Xinsheng
    Huang, Kaizhi
    Li, Bin
    Kang, Xiaolei
    SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (04)
  • [29] Opportunistic access control for enhancing security in D2D-enabled cellular networks
    Yajun Chen
    Xinsheng Ji
    Kaizhi Huang
    Bin Li
    Xiaolei Kang
    Science China Information Sciences, 2018, 61
  • [30] A Deep Learning based Resource Allocation Algorithm for Variable Dimensions in D2D-Enabled Cellular Networks
    Pei, Errong
    Yang, Guangcai
    2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 277 - 282