Toward Packet Routing With Fully Distributed Multiagent Deep Reinforcement Learning

被引:38
|
作者
You, Xinyu [1 ]
Li, Xuanjie [1 ]
Xu, Yuedong [1 ]
Feng, Hui [1 ]
Zhao, Jin [2 ]
Yan, Huaicheng [3 ,4 ]
机构
[1] Fudan Univ, Sch Informat Sci & Technol, Shanghai 200237, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 200237, Peoples R China
[3] East China Univ Sci & Technol, Minist Educ, Key Lab Adv Control & Optimizat Chem Proc, Shanghai 200237, Peoples R China
[4] Hubei Normal Univ, Coll Mechatron & Control Engn, Huangshi 435002, Hubei, Peoples R China
关键词
Routing; Training; Delays; Heuristic algorithms; Optimization; Biological neural networks; Prediction algorithms; Deep reinforcement learning (DRL); local communications; multiagent learning; packet routing; TRACKING CONTROL; SYSTEMS; TIME;
D O I
10.1109/TSMC.2020.3012832
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Packet routing is one of the fundamental problems in computer networks in which a router determines the next-hop of each packet in the queue to get it as quickly as possible to its destination. Reinforcement learning (RL) has been introduced to design autonomous packet routing policies with local information of stochastic packet arrival and service. However, the curse of dimensionality of RL prohibits the more comprehensive representation of dynamic network states, thus limiting its potential benefit. In this article, we propose a novel packet routing framework based on multiagent deep RL (DRL) in which each router possess an independent long short term memory (LSTM) recurrent neural network (RNN) for training and decision making in a fully distributed environment. The LSTM RNN extracts routing features from rich information regarding backlogged packets and past actions, and effectively approximates the value function of Q-learning. We further allow each route to communicate periodically with direct neighbors so that a broader view of network state can be incorporated. The experimental results manifest that our multiagent DRL policy can strike the delicate balance between congestion-aware and shortest routes, and significantly reduce the packet delivery time in general network topologies compared with its counterparts.
引用
收藏
页码:855 / 868
页数:14
相关论文
共 50 条
  • [1] Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning
    You, Xinyu
    Li, Xuanjie
    Xu, Yuedong
    Feng, Hui
    Zhao, Jin
    17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 31 - 38
  • [2] Distributed Routing Optimization Algorithm for FANET Based on Multiagent Reinforcement Learning
    Ke, Yaqi
    Huang, Kai
    Qiu, Xiulin
    Song, Bo
    Xu, Lei
    Yin, Jun
    Yang, Yuwang
    IEEE SENSORS JOURNAL, 2024, 24 (15) : 24851 - 24864
  • [3] Toward Intelligent Multizone Thermal Control With Multiagent Deep Reinforcement Learning
    Li, Jie
    Zhang, Wei
    Gao, Guanyu
    Wen, Yonggang
    Jin, Guangyu
    Christopoulos, Georgios
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (14) : 11150 - 11162
  • [4] A Deep Reinforcement Learning-Based Geographic Packet Routing Optimization
    Bai, Yijie
    Zhang, Xia
    Yu, Daojie
    Li, Shengxiang
    Wang, Yu
    Lei, Shuntian
    Tian, Zhoutai
    IEEE ACCESS, 2022, 10 : 108785 - 108796
  • [5] Interterminal Truck Routing Optimization Using Cooperative Multiagent Deep Reinforcement Learning
    Adi, Taufik Nur
    Bae, Hyerim
    Iskandar, Yelita Anggiane
    PROCESSES, 2021, 9 (10)
  • [6] Distributed Multiagent Deep Reinforcement Learning for Multiline Dynamic Bus Timetable Optimization
    Yan, Haoyang
    Cui, Zhiyong
    Chen, Xinqiang
    Ma, Xiaolei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 469 - 479
  • [7] Distributed Neural Learning Algorithms for Multiagent Reinforcement Learning
    Dai, Pengcheng
    Liu, Hongzhe
    Yu, Wenwu
    Wang, He
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 21039 - 21060
  • [8] CoDRL: Intelligent Packet Routing in SDN Using Convolutional Deep Reinforcement Learning
    Swain, Pravati
    Kamalia, Uttam
    Bhandarkar, Raj
    Modi, Tejas
    13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATION SYSTEMS (IEEE ANTS), 2019,
  • [9] Distributed Vehicle Tracking in Wireless Sensor Network: A Fully Decentralized Multiagent Reinforcement Learning Approach
    Liang, Teng
    Lin, Yan
    Shi, Long
    Li, Jun
    Zhang, Yijin
    Qian, Yuwen
    IEEE SENSORS LETTERS, 2021, 5 (01) : 1 - 4
  • [10] A Multiagent Quantum Deep Reinforcement Learning Method for Distributed Frequency Control of Islanded Microgrids
    Yan, Rudai
    Wang, Yu
    Xu, Yan
    Dai, Jiahong
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (04): : 1622 - 1632