Toward Packet Routing With Fully Distributed Multiagent Deep Reinforcement Learning

被引：38

作者：

You, Xinyu ^{[1
]}

Li, Xuanjie ^{[1
]}

Xu, Yuedong ^{[1
]}

Feng, Hui ^{[1
]}

Zhao, Jin ^{[2
]}

Yan, Huaicheng ^{[3
,4
]}

机构：

[1] Fudan Univ, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China

[2] Fudan Univ, Sch Comp Sci, Shanghai 200237, Peoples R China

[3] East China Univ Sci & Technol, Minist Educ, Key Lab Adv Control & Optimizat Chem Proc, Shanghai 200237, Peoples R China

[4] Hubei Normal Univ, Coll Mechatron & Control Engn, Huangshi 435002, Hubei, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 02期

关键词：

Routing; Training; Delays; Heuristic algorithms; Optimization; Biological neural networks; Prediction algorithms; Deep reinforcement learning (DRL); local communications; multiagent learning; packet routing; TRACKING CONTROL; SYSTEMS; TIME;

D O I：

10.1109/TSMC.2020.3012832

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Packet routing is one of the fundamental problems in computer networks in which a router determines the next-hop of each packet in the queue to get it as quickly as possible to its destination. Reinforcement learning (RL) has been introduced to design autonomous packet routing policies with local information of stochastic packet arrival and service. However, the curse of dimensionality of RL prohibits the more comprehensive representation of dynamic network states, thus limiting its potential benefit. In this article, we propose a novel packet routing framework based on multiagent deep RL (DRL) in which each router possess an independent long short term memory (LSTM) recurrent neural network (RNN) for training and decision making in a fully distributed environment. The LSTM RNN extracts routing features from rich information regarding backlogged packets and past actions, and effectively approximates the value function of Q-learning. We further allow each route to communicate periodically with direct neighbors so that a broader view of network state can be incorporated. The experimental results manifest that our multiagent DRL policy can strike the delicate balance between congestion-aware and shortest routes, and significantly reduce the packet delivery time in general network topologies compared with its counterparts.

引用

页码：855 / 868

页数：14

共 50 条

[1] Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning
You, Xinyu
Li, Xuanjie
Xu, Yuedong
Feng, Hui
Zhao, Jin
17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 31 - 38
[2] Distributed Routing Optimization Algorithm for FANET Based on Multiagent Reinforcement Learning
Ke, Yaqi
Huang, Kai
Qiu, Xiulin
Song, Bo
Xu, Lei
Yin, Jun
Yang, Yuwang
IEEE SENSORS JOURNAL, 2024, 24 (15) : 24851 - 24864
[3] Toward Intelligent Multizone Thermal Control With Multiagent Deep Reinforcement Learning
Li, Jie
Zhang, Wei
Gao, Guanyu
Wen, Yonggang
Jin, Guangyu
Christopoulos, Georgios
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (14) : 11150 - 11162
[4] A Deep Reinforcement Learning-Based Geographic Packet Routing Optimization
Bai, Yijie
Zhang, Xia
Yu, Daojie
Li, Shengxiang
Wang, Yu
Lei, Shuntian
Tian, Zhoutai
IEEE ACCESS, 2022, 10 : 108785 - 108796
[5] Interterminal Truck Routing Optimization Using Cooperative Multiagent Deep Reinforcement Learning
Adi, Taufik Nur
Bae, Hyerim
Iskandar, Yelita Anggiane
PROCESSES, 2021, 9 (10)
[6] Distributed Multiagent Deep Reinforcement Learning for Multiline Dynamic Bus Timetable Optimization
Yan, Haoyang
Cui, Zhiyong
Chen, Xinqiang
Ma, Xiaolei
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 469 - 479
[7] Distributed Neural Learning Algorithms for Multiagent Reinforcement Learning
Dai, Pengcheng
Liu, Hongzhe
Yu, Wenwu
Wang, He
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 21039 - 21060
[8] CoDRL: Intelligent Packet Routing in SDN Using Convolutional Deep Reinforcement Learning
Swain, Pravati
Kamalia, Uttam
Bhandarkar, Raj
Modi, Tejas
13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATION SYSTEMS (IEEE ANTS), 2019,
[9] Distributed Vehicle Tracking in Wireless Sensor Network: A Fully Decentralized Multiagent Reinforcement Learning Approach
Liang, Teng
Lin, Yan
Shi, Long
Li, Jun
Zhang, Yijin
Qian, Yuwen
IEEE SENSORS LETTERS, 2021, 5 (01) : 1 - 4
[10] A Multiagent Quantum Deep Reinforcement Learning Method for Distributed Frequency Control of Islanded Microgrids
Yan, Rudai
Wang, Yu
Xu, Yan
Dai, Jiahong
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (04): : 1622 - 1632

← 1 2 3 4 5 →