Multi-Agent Reinforcement Learning Based Energy Efficiency Optimization in NB-IoT Networks

被引:7
|
作者
Guo, Yuancheng [1 ]
Xiang, Min [1 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London, England
关键词
NB-IoT; MARL; WoLF-PHC; power ramping; preamble allocation; energy efficiency;
D O I
10.1109/gcwkshps45667.2019.9024676
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Based on the existing Evolved Packet System (EPS) architecture, Narrowband Internet of Things (NB-IoT) has been expected as a promising paradigm to support energy-aware massive Machine Type Communications (mMTC). However, with the tremendous increase of IoT devices, as well as their requirements of energy-saving and low-cost, current power ramping and preamble allocation mechanisms in legacy long term evolution (LTE) can hardly achieve high energy efficiency in machine-to-machine (M2M) communications, mainly resulting from the significant redundancy of control signals. Due to the strict restrictions of NB-IoT, up till the present moment, the standardized preamble allocation mechanism is still randomly picking. To satisfy these constrained conditions in NB-IoT, this work proposes a joint optimization framework of power ramping and preamble picking to improve the energy efficiency of NB-IoT systems. In this optimization problem, a comprehensive energy estimation model is established, which investigates the inadequacy of random access (RA) procedure and meanwhile reveals the effects of power ramping and preamble picking on energy efficiency. In addition, to search the optimal policies of the joint optimization formulated. A distributed Multi-Agent Reinforcement Learning (MARL) algorithm based on Win-or-Learn-Fast Policy Hill-Climbing (WOLF-PHC) is proposed, in which a "stateless" modification is introduced to reduce the algorithm complexity significantly. The performance of high energy efficiency is validated in simulations, which also reveal the applicability and convergence of the designed WOLF-PHC based optimization algorithm.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Reinforcement Learning for Real-Time Optimization in NB-IoT Networks
    Jiang, Nan
    Deng, Yansha
    Nallanathan, Arumugam
    Chambers, Jonathon A.
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (06) : 1424 - 1440
  • [2] Multi-agent Reinforcement Learning for Green Energy Powered IoT Networks with Random Access
    Han, Mengqi
    Del Castillo, Luis Arocas
    Khairy, Sami
    Chen, Xuehan
    Cai, Lin X.
    Lin, Bin
    Hou, Fen
    [J]. 2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [3] Reinforcement-Learning-Based Optimization on Energy Efficiency in UAV Networks for IoT
    Deng, Dan
    Li, Junxia
    Jhaveri, Rutvij H. H.
    Tiwari, Prayag
    Ijaz, Muhammad Fazal
    Ou, Jiangtao
    Fan, Chengyuan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) : 2767 - 2775
  • [4] COOPERATIVE DEEP REINFORCEMENT LEARNING FOR MULTIPLE-GROUP NB-IOT NETWORKS OPTIMIZATION
    Jiang, Nan
    Deng, Yansha
    Simeone, Osvaldo
    Nallanathan, Arumugam
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8424 - 8428
  • [5] Visualizing Multi-Agent Reinforcement Learning for Robotic Communication in Industrial IoT Networks
    Luo, Ruyu
    Ni, Wanli
    Tian, Hui
    [J]. IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [6] Multi-Agent Reinforcement Learning for Resource Allocation in IoT Networks with Edge Computing
    Liu, Xiaolan
    Yu, Jiadong
    Feng, Zhiyong
    Gao, Yue
    [J]. CHINA COMMUNICATIONS, 2020, 17 (09) : 220 - 236
  • [7] Access Control in NB-IoT Networks: A Deep Reinforcement Learning Strategy
    Hadjadj-Aoul, Yassine
    Ait-Chellouche, Soraya
    [J]. INFORMATION, 2020, 11 (11) : 1 - 16
  • [8] Deep Reinforcement Learning for NPDCCH Period Adjustment in NB-IoT Networks
    Yu, Ya-Ju
    Chuang, Ching-Chih
    Cheng, Yu-Wei
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1883 - 1888
  • [9] Distributed localization for IoT with multi-agent reinforcement learning
    Jia, Jie
    Yu, Ruoying
    Du, Zhenjun
    Chen, Jian
    Wang, Qinghu
    Wang, Xingwei
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (09): : 7227 - 7240
  • [10] Distributed localization for IoT with multi-agent reinforcement learning
    Jie Jia
    Ruoying Yu
    Zhenjun Du
    Jian Chen
    Qinghu Wang
    Xingwei Wang
    [J]. Neural Computing and Applications, 2022, 34 : 7227 - 7240