Deep Reinforcement Learning for Scheduling Uplink IoT Traffic with Strict Deadlines

被引:2
|
作者
Robaglia, Benoit-Marie [1 ,2 ]
Destounis, Apostolos [1 ]
Coupechoux, Marceau [2 ]
Tsilimantos, Dimitrios [1 ]
机构
[1] Huawei Technol Co Ltd, Math & Algorithm Sci Lab, Paris Res Ctr, Shenzhen, Guangdong, Peoples R China
[2] Inst Polytech Paris, Telecom Paris, LTCI, Palaiseau, France
关键词
Multiple Access; Reinforcement Learning; Proximal Policy Optimization; POMDP; Internet of Things; Wireless sensor networks; scheduling; SPECTRUM ACCESS;
D O I
10.1109/GLOBECOM46510.2021.9685561
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers the Multiple Access problem where N Internet of Things (IoT) devices share a common wireless medium towards a central Base Station (BS). We propose a Reinforcement Learning (RL) method where the BS is the agent and the devices are part of the environment. A device is allowed to transmit only when the BS decides to schedule it. Besides the information packets, devices send additional messages like the delay or the number of discarded packets since their last transmission. This information is used to design the RL reward function and constitutes the next observation that the agent can use to schedule the next device. Leveraging RL allows us to learn the sporadic and heterogeneous traffic patterns of the IoT devices and an optimal scheduling policy that maximizes the channel throughput. We adapt the Proximal Policy Optimization (PPO) algorithm with a Recurrent Neural Network (RNN) to handle the partial observability of our problem and exploit the temporal correlations of the users' traffic. We demonstrate the performance of our model through simulations on different number of heterogeneous devices with periodic traffic and individual latency constraints. We show that our RL algorithm outperforms traditional scheduling schemes and distributed medium access algorithms.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] SeqDQN: Multi-Agent Deep Reinforcement Learning for Uplink URLLC with Strict Deadlines
    Robaglia, Benoit Marie
    Coupechoux, Marceau
    Tsilimantos, Dimitrios
    Destounis, Apostolos
    [J]. 2023 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT, EUCNC/6G SUMMIT, 2023, : 623 - 628
  • [2] Cellular Network Traffic Scheduling with Deep Reinforcement Learning
    Chinchali, Sandeep
    Hu, Pan
    Chu, Tianshu
    Sharma, Manu
    Bansal, Manu
    Misra, Rakesh
    Pavone, Marco
    Katti, Sachin
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 766 - 774
  • [3] OPTIMAL SCHEDULING WITH STRICT DEADLINES
    BHATTACHARYA, PP
    EPHREMIDES, A
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1989, 34 (07) : 721 - 728
  • [4] Optimal scheduling with strict deadlines
    [J]. Bhattacharya, Partha P., 1600, (34):
  • [5] Deep Reinforcement Learning for Uplink Scheduling in NOMA-URLLC Networks
    Robaglia, Benoît-Marie
    Coupechoux, Marceau
    Tsilimantos, Dimitrios
    [J]. IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 1142 - 1158
  • [6] Iot Data Processing and Scheduling Based on Deep Reinforcement Learning
    Jiang, Yuchuan
    Wang, Zhangjun
    Jin, Zhixiong
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2023, 18 (06)
  • [7] Service Centric Scheduling With Strict Deadlines
    Ramirez, David
    Aazhang, Behnaam
    [J]. 2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
  • [8] Traffic scheduling, network slicing and virtualization based on deep reinforcement learning
    Kumar, Priyan Malarvizhi
    Basheer, Shakila
    Rawal, Bharat S.
    Afghah, Fatemeh
    Babu, Gokulnath Chandra
    Arunmozhi, Manimuthu
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [9] Delay-aware Cellular Traffic Scheduling with Deep Reinforcement Learning
    Zhang, Ticao
    Shen, Shuyi
    Mao, Shiwen
    Chang, Gee-Kung
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [10] AoI-Aware Resource Scheduling for Industrial IoT with Deep Reinforcement Learning
    Li, Hongzhi
    Tang, Lin
    Chen, Shengwei
    Zheng, Libin
    Zhong, Shaohong
    [J]. ELECTRONICS, 2024, 13 (06)