Deep Reinforcement Learning Based Dynamic Channel Allocation Algorithm in Multibeam Satellite Systems

被引:86
|
作者
Liu, Shuaijun
Hu, Xin [1 ]
Wang, Weidong
机构
[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Beijing 100876, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Dynamic channel allocation (DCA); multibeam satellite systems; Markov decision process (MDP); deep reinforcement learning (DRL); blocking probability; GAME;
D O I
10.1109/ACCESS.2018.2809581
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic channel allocation (DCA) is the key technology to efficiently utilize the spectrum resources and decrease the co-channel interference for multibeam satellite systems. Most works allocate the channel on the basis of the beam traffic load or the user terminal distribution of the current moment. These greedy-like algorithms neglect the intrinsic temporal correlation among the sequential channel allocation decisions, resulting in the spectrum resources underutilization. To solve this problem, a novel deep reinforcement learning (DRL)-based DCA (DRL-DCA) algorithm is proposed. Specifically, the DCA optimization problem, which aims at minimizing the service blocking probability, is formulated in the multibeam satellite systems. Due to the temporal correlation property, the DCA optimization problem is modeled as the Markov decision process (MDP) which is the dominant analytical approach in DRL. In modeled MDP, the system state is reformulated into an image-like fashion, and then, convolutional neural network is used to extract useful features. Simulation results show that the DRL-DCA algorithm can decrease the blocking probability and improve the carried traffic and spectrum efficiency compared with other channel allocation algorithms.
引用
收藏
页码:15733 / 15742
页数:10
相关论文
共 50 条
  • [1] An online power allocation algorithm based on deep reinforcement learning in multibeam satellite systems
    Zhang, Pei
    Wang, Xiaohui
    Ma, Zhiguo
    Liu, Shuaijun
    Song, Junde
    [J]. INTERNATIONAL JOURNAL OF SATELLITE COMMUNICATIONS AND NETWORKING, 2020, 38 (05) : 450 - 461
  • [2] A Deep Reinforcement Learning-Based Framework for Dynamic Resource Allocation in Multibeam Satellite Systems
    Hu, Xin
    Liu, Shuaijun
    Chen, Rong
    Wang, Weidong
    Wang, Chunting
    [J]. IEEE COMMUNICATIONS LETTERS, 2018, 22 (08) : 1612 - 1615
  • [3] Dynamic Resource Allocation With Deep Reinforcement Learning in Multibeam Satellite Communication
    Deng, Danhao
    Wang, Chaowei
    Pang, Mingliang
    Wang, Weidong
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (01) : 75 - 79
  • [4] Deep reinforcement learning-based beam Hopping algorithm in multibeam satellite systems
    Hu, Xin
    Liu, Shuaijun
    Wang, Yipeng
    Xu, Lexi
    Zhang, Yuchen
    Wang, Cheng
    Wang, Weidong
    [J]. IET COMMUNICATIONS, 2019, 13 (16) : 2485 - 2491
  • [5] Dynamic Channel Allocation for Satellite Internet of Things via Deep Reinforcement Learning
    Liu, Jiahao
    Zhao, Baokang
    Xin, Qin
    Liu, Hua
    [J]. 2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 465 - 470
  • [6] A novel deep reinforcement learning architecture for dynamic power and bandwidth allocation in multibeam satellites
    Xu, Jing
    Zhao, Zhongtian
    Wang, Lei
    Zhang, Yizhai
    [J]. ACTA ASTRONAUTICA, 2023, 204 : 73 - 82
  • [7] LEO Satellite Channel Allocation Scheme Based on Reinforcement Learning
    Zheng, Fei
    Pi, Zhao
    Zhou, Zou
    Wang, Kaixuan
    [J]. MOBILE INFORMATION SYSTEMS, 2020, 2020
  • [8] Reinforcement learning for dynamic channel allocation in cellular telephone systems
    Singh, S
    Bertsekas, D
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 974 - 980
  • [9] Reinforcement Learning for Dynamic Channel Allocation in Mobile Cellular Systems
    Ranjan, Rajeev
    Phophalia, Anukriti
    [J]. INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN MICROWAVE THEORY AND APPLICATIONS, PROCEEDINGS, 2008, : 924 - 927
  • [10] BeiDou Short-Message Satellite Resource Allocation Algorithm Based on Deep Reinforcement Learning
    Xia, Kaiwen
    Feng, Jing
    Yan, Chao
    Duan, Chaofan
    [J]. ENTROPY, 2021, 23 (08)