Deep Reinforcement Learning for Uplink Scheduling in NOMA-URLLC Networks

被引:0
|
作者
Robaglia, Benoît-Marie [1 ]
Coupechoux, Marceau [1 ]
Tsilimantos, Dimitrios [2 ]
机构
[1] Institut Polytechnique de Paris, LTCI, Télécom Paris, Palaiseau,91764, France
[2] Huawei Technologies Company Ltd., Advanced Wireless Technology Laboratory, Paris Research Center, Boulogne-Billancourt,92100, France
关键词
Deep learning - Learning algorithms - Markov processes - Mathematical transformations - Mobile telecommunication systems - Reinforcement learning - Scheduling algorithms;
D O I
10.1109/TMLCN.2024.3437351
中图分类号
学科分类号
摘要
This article addresses the problem of Ultra Reliable Low Latency Communications (URLLC) in wireless networks, a framework with particularly stringent constraints imposed by many Internet of Things (IoT) applications from diverse sectors. We propose a novel Deep Reinforcement Learning (DRL) scheduling algorithm, named NOMA-PPO, to solve the Non-Orthogonal Multiple Access (NOMA) uplink URLLC scheduling problem involving strict deadlines. The challenge of addressing uplink URLLC requirements in NOMA systems is related to the combinatorial complexity of the action space due to the possibility to schedule multiple devices, and to the partial observability constraint that we impose to our algorithm in order to meet the IoT communication constraints and be scalable. Our approach involves 1) formulating the NOMA-URLLC problem as a Partially Observable Markov Decision Process (POMDP) and the introduction of an agent state, serving as a sufficient statistic of past observations and actions, enabling a transformation of the POMDP into a Markov Decision Process (MDP); 2) adapting the Proximal Policy Optimization (PPO) algorithm to handle the combinatorial action space; 3) incorporating prior knowledge into the learning agent with the introduction of a Bayesian policy. Numerical results reveal that not only does our approach outperform traditional multiple access protocols and DRL benchmarks on 3GPP scenarios, but also proves to be robust under various channel and traffic configurations, efficiently exploiting inherent time correlations. © 2023 CCBY.
引用
收藏
页码:1142 / 1158
相关论文
共 50 条
  • [1] A Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks
    Ahsan, Waleed
    Yi, Wenqiang
    Liu, Yuanwei
    Nallanathan, Arumugam
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (08) : 5989 - 6002
  • [2] A Multi-Agent Reinforcement Learning Approach for Massive Access in NOMA-URLLC Networks
    Han, Huimei
    Jiang, Xin
    Lu, Weidang
    Zhai, Wenchao
    Li, Ying
    Kumar, Neeraj
    Guizani, Mohsen
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (12) : 16799 - 16804
  • [3] On using Deep Reinforcement Learning to reduce Uplink Latency for uRLLC services
    Boutiba, Karim
    Bagaa, Miloud
    Ksentini, Adlen
    [J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 407 - 412
  • [4] Reliable Reinforcement Learning Based NOMA Schemes for URLLC
    Ahsan, Waleed
    Yi, Wenqiang
    Liu, Yuanwei
    Nallanathan, Arumugam
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [5] SeqDQN: Multi-Agent Deep Reinforcement Learning for Uplink URLLC with Strict Deadlines
    Robaglia, Benoit Marie
    Coupechoux, Marceau
    Tsilimantos, Dimitrios
    Destounis, Apostolos
    [J]. 2023 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT, EUCNC/6G SUMMIT, 2023, : 623 - 628
  • [6] Deep Reinforcement Learning-Based Joint Scheduling of eMBB and URLLC in 5G Networks
    Li, Jing
    Zhang, Xing
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (09) : 1543 - 1546
  • [7] Task Scheduling and Power Allocation in Multiuser Multiserver Vehicular Networks by NOMA and Deep Reinforcement Learning
    Cong, Yuliang
    Liu, Maiou
    Wang, Cong
    Sun, Shuxian
    Hu, Fengye
    Liu, Zhan
    Wang, Chaoying
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (13): : 23532 - 23543
  • [8] Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach
    Li, Xiangjun
    Cui, Qimei
    Zhai, Jinli
    Huang, Xueqing
    [J]. SENSORS, 2021, 21 (13)
  • [9] Coexistence Management for URLLC in Campus Networks via Deep Reinforcement Learning
    Khodapanah, Behnam
    Hoessler, Tom
    Yuncu, Baris
    Barreto, Andre Noll
    Simsek, Meryem
    Fettweis, Gerhard
    [J]. 2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [10] Deep Reinforcement Learning for Scheduling in Cellular Networks
    Wang, Jian
    Xu, Chen
    Huangfu, Yourui
    Li, Rong
    Ge, Yiqun
    Wang, Jun
    [J]. 2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,