Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach

被引:0
|
作者
Cao, Yang [1 ]
Lien, Shao-Yu [2 ]
Liang, Ying-Chang [3 ]
Niyato, Dusit [4 ]
Shen, Xuemin [5 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Peoples R China
[2] Natl Yang Ming Chiao Tung Univ, Inst Intelligent Syst, Tainan 711, Taiwan
[3] Univ Elect Sci & Technol China, Ctr Intelligent Networking & Commun CINC, Chengdu 611731, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[5] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
关键词
Low earth orbit satellites; Satellite broadcasting; Satellites; Optimization; Convergence; Resource management; 3GPP; Non-terrestrial networks (NTNs); earth-fixed cell; beam management; resource allocation; deep reinforcement learning (DRL); multi-time-scale Markov decision process (MMDPs);
D O I
10.1109/TWC.2023.3323554
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Constructing earth-fixed cells with low-earth orbit (LEO) satellites in non-terrestrial networks (NTNs) has been the most promising paradigm to enable global coverage. The limited computing capabilities on LEO satellites however render tackling resource optimization within a short duration a critical challenge. Although the sufficient computing capabilities of the ground infrastructures can be utilized to assist the LEO satellite, different time-scale control cycles and coupling decisions between the space- and ground-segments still obstruct the joint optimization design for computing agents at different segments. To address the above challenges, in this paper, a multi-time-scale deep reinforcement learning (DRL) scheme is developed for achieving the radio resource optimization in NTNs, in which the LEO satellite and user equipment (UE) collaborate with each other to perform individual decision-making tasks with different control cycles. Specifically, the UE updates its policy toward improving value functions of both the satellite and UE, while the LEO satellite only performs finite-step rollout for decision-makings based on the reference decision trajectory provided by the UE. Most importantly, rigorous analysis to guarantee the performance convergence of the proposed scheme is provided. Comprehensive simulations are conducted to justify the effectiveness of the proposed scheme in balancing the transmission performance and computational complexity.
引用
收藏
页码:4932 / 4949
页数:18
相关论文
共 50 条
  • [1] Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    Niyato, Dusit
    Shen, Xuemin
    [J]. 2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [2] Multi-Tier Deep Reinforcement Learning for Non-Terrestrial Networks
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    Niyato, Dusit
    [J]. IEEE WIRELESS COMMUNICATIONS, 2024, 31 (03) : 194 - 201
  • [3] Autonomous Non-Terrestrial Base Station Deployment for Non-Terrestrial Networks: A Reinforcement Learning Approach
    Lien, Shao-Yu
    Deng, Der-Jiunn
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (10) : 10894 - 10909
  • [4] Deep Reinforcement Learning For Multi-User Access Control in Non-Terrestrial Networks
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (03) : 1605 - 1619
  • [5] Multi-tier Collaborative Deep Reinforcement Learning for Non-terrestrial Network Empowered Vehicular Connections
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    [J]. 2021 IEEE 29TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP 2021), 2021,
  • [6] Multi-agent deep reinforcement learning for user association and resource allocation in integrated terrestrial and non-terrestrial networks
    Birabwa, Denise Joanitah
    Ramotsoela, Daniel
    Ventura, Neco
    [J]. COMPUTER NETWORKS, 2023, 231
  • [7] Collaborative Computing in Vehicular Networks: A Deep Reinforcement Learning Approach
    Li, Mushu
    Gao, Jie
    Zhang, Ning
    Zhao, Lian
    Shen, Xuemin
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [8] Multi-Agent Deep Reinforcement Learning for Interference-Aware Channel Allocation in Non-Terrestrial Networks
    Cho, Yeongi
    Yang, Wooyeol
    Oh, Daesub
    Jo, Han-Shin
    [J]. IEEE COMMUNICATIONS LETTERS, 2023, 27 (03) : 936 - 940
  • [9] Traffic Scheduling in Non-Stationary Multipath Non-Terrestrial Networks: A Reinforcement Learning Approach
    Machumilane, Achilles
    Gotta, Alberto
    Cassara, Pietro
    Gennaro, Claudio
    Amato, Giuseppe
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4094 - 4099
  • [10] Hierarchical Reinforcement Learning for Multi-Layer Multi-Service Non-Terrestrial Vehicular Edge Computing
    Shinde, Swapnil Sadashiv
    Tarchi, Daniele
    [J]. IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 1045 - 1061