Multi-Agent Reinforcement Learning Based Energy Efficiency Optimization in NB-IoT Networks

被引：7

作者：

Guo, Yuancheng ^{[1
]}

Xiang, Min ^{[1
]}

机构：

[1] Imperial Coll London, Dept Elect & Elect Engn, London, England

来源：

2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS) | 2019年

关键词：

NB-IoT; MARL; WoLF-PHC; power ramping; preamble allocation; energy efficiency;

D O I：

10.1109/gcwkshps45667.2019.9024676

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Based on the existing Evolved Packet System (EPS) architecture, Narrowband Internet of Things (NB-IoT) has been expected as a promising paradigm to support energy-aware massive Machine Type Communications (mMTC). However, with the tremendous increase of IoT devices, as well as their requirements of energy-saving and low-cost, current power ramping and preamble allocation mechanisms in legacy long term evolution (LTE) can hardly achieve high energy efficiency in machine-to-machine (M2M) communications, mainly resulting from the significant redundancy of control signals. Due to the strict restrictions of NB-IoT, up till the present moment, the standardized preamble allocation mechanism is still randomly picking. To satisfy these constrained conditions in NB-IoT, this work proposes a joint optimization framework of power ramping and preamble picking to improve the energy efficiency of NB-IoT systems. In this optimization problem, a comprehensive energy estimation model is established, which investigates the inadequacy of random access (RA) procedure and meanwhile reveals the effects of power ramping and preamble picking on energy efficiency. In addition, to search the optimal policies of the joint optimization formulated. A distributed Multi-Agent Reinforcement Learning (MARL) algorithm based on Win-or-Learn-Fast Policy Hill-Climbing (WOLF-PHC) is proposed, in which a "stateless" modification is introduced to reduce the algorithm complexity significantly. The performance of high energy efficiency is validated in simulations, which also reveal the applicability and convergence of the designed WOLF-PHC based optimization algorithm.

引用

页数：6

共 50 条

[1] Reinforcement Learning for Real-Time Optimization in NB-IoT Networks
Jiang, Nan
Deng, Yansha
Nallanathan, Arumugam
Chambers, Jonathon A.
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (06) : 1424 - 1440
[2] Multi-agent Reinforcement Learning for Green Energy Powered IoT Networks with Random Access
Han, Mengqi
Del Castillo, Luis Arocas
Khairy, Sami
Chen, Xuehan
Cai, Lin X.
Lin, Bin
Hou, Fen
2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
[3] Reinforcement-Learning-Based Optimization on Energy Efficiency in UAV Networks for IoT
Deng, Dan
Li, Junxia
Jhaveri, Rutvij H. H.
Tiwari, Prayag
Ijaz, Muhammad Fazal
Ou, Jiangtao
Fan, Chengyuan
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) : 2767 - 2775
[4] COOPERATIVE DEEP REINFORCEMENT LEARNING FOR MULTIPLE-GROUP NB-IOT NETWORKS OPTIMIZATION
Jiang, Nan
Deng, Yansha
Simeone, Osvaldo
Nallanathan, Arumugam
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8424 - 8428
[5] Visualizing Multi-Agent Reinforcement Learning for Robotic Communication in Industrial IoT Networks
Luo, Ruyu
Ni, Wanli
Tian, Hui
IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
[6] Access Control in NB-IoT Networks: A Deep Reinforcement Learning Strategy
Hadjadj-Aoul, Yassine
Ait-Chellouche, Soraya
INFORMATION, 2020, 11 (11) : 1 - 16
[7] Multi-Agent Reinforcement Learning for Resource Allocation in IoT Networks with Edge Computing
Liu, Xiaolan
Yu, Jiadong
Feng, Zhiyong
Gao, Yue
CHINA COMMUNICATIONS, 2020, 17 (09) : 220 - 236
[8] Deep Reinforcement Learning for NPDCCH Period Adjustment in NB-IoT Networks
Yu, Ya-Ju
Chuang, Ching-Chih
Cheng, Yu-Wei
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1883 - 1888
[9] Distributed localization for IoT with multi-agent reinforcement learning
Jia, Jie
Yu, Ruoying
Du, Zhenjun
Chen, Jian
Wang, Qinghu
Wang, Xingwei
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (09): : 7227 - 7240
[10] Distributed localization for IoT with multi-agent reinforcement learning
Jie Jia
Ruoying Yu
Zhenjun Du
Jian Chen
Qinghu Wang
Xingwei Wang
Neural Computing and Applications, 2022, 34 : 7227 - 7240

← 1 2 3 4 5 →