A Lightweight Reinforcement Learning Based Packet Routing Method Using Online Sequential Learning

被引：1

作者：

Nemoto, Kenji ^{[1
]}

Matsutani, Hiroki ^{[1
]}

机构：

[1] Keio Univ, Grad Sch Sci & Technol, Yokohama 2238522, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2023年 / E106D卷 / 11期

关键词：

reinforcement learning; packet routing; neural networks; OS-ELM;

D O I：

10.1587/transinf.2022EDP7231

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing simple routing protocols (e.g., OSPF, RIP) have some disadvantages of being inflexible and prone to congestion due to the concentration of packets on particular routers. To address these issues, packet routing methods using machine learning have been proposed recently. Compared to these algorithms, machine learning based methods can choose a routing path intelligently by learning efficient routes. However, machine learning based methods have a disadvantage of training time overhead. We thus focus on a lightweight machine learning algorithm, OS-ELM (Online Sequential Extreme Learning Machine), to reduce the training time. Although previous work on reinforcement learning using OS-ELM exists, it has a problem of low learning accuracy. In this paper, we propose OS-ELM QN (Q-Network) with a prioritized experience replay buffer to improve the learning performance. It is compared to a deep reinforcement learning based packet routing method using a network simulator. Experimental results show that introducing the experience replay buffer improves the learning performance. OS-ELM QN achieves a 2.33 times speedup than a DQN (Deep Q-Network) in terms of learning speed. Regarding the packet transfer latency, OS-ELM QN is comparable or slightly inferior to the DQN while they are better than OSPF in most cases since they can distribute congestions.

引用

页码：1796 / 1807

页数：12

共 50 条

[1] A Packet Routing using Lightweight Reinforcement Learning Based on Online Sequential Learning
Nemoto, Kenji
Matsutani, Hiroki
2022 TENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS, CANDARW, 2022, : 76 - 82
[2] Packet Routing Method for Multi-Stage Networks Based on Reinforcement Learning
Gao Y.
Luo L.
Sun G.
Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (02): : 200 - 206
[3] An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning
Watanabe, Hirohisa
Tsukada, Mineto
Matsutani, Hiroki
2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 96 - 103
[4] Reinforcement learning-based load shared sequential routing
Heidari, Fariba
Manner, Shie
Mason, Lorne G.
NETWORKING 2007: AD HOC AND SENSOR NETWORKS, WIRELESS NETWORKS, NEXT GENERATION INTERNET, PROCEEDINGS, 2007, 4479 : 832 - +
[5] A Deep Reinforcement Learning-Based Geographic Packet Routing Optimization
Bai, Yijie
Zhang, Xia
Yu, Daojie
Li, Shengxiang
Wang, Yu
Lei, Shuntian
Tian, Zhoutai
IEEE ACCESS, 2022, 10 : 108785 - 108796
[6] CoDRL: Intelligent Packet Routing in SDN Using Convolutional Deep Reinforcement Learning
Swain, Pravati
Kamalia, Uttam
Bhandarkar, Raj
Modi, Tejas
13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATION SYSTEMS (IEEE ANTS), 2019,
[7] Dynamic Packet Routing Algorithm Based on Multidimensional Information and Multiagent Reinforcement Learning
Zhang, Linliang
Du, Ruifang
Hao, Zhiqiang
Li, Shuo
Hu, Zhiguo
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2025, 38 (06)
[8] Online mobile learning resource recommendation method based on deep reinforcement learning
Li, Pingyang
Zhang, Juan
INTERNATIONAL JOURNAL OF INNOVATION AND SUSTAINABLE DEVELOPMENT, 2025, 19 (01)
[9] An online feature learning algorithm using HCI-based reinforcement learning
Liu, F
Su, JB
ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 293 - 298
[10] AN ONLINE CROWD SEMANTIC SEGMENTATION METHOD BASED ON REINFORCEMENT LEARNING
Cheng, Yu
Yang, Hua
Chen, Lin
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2429 - 2433

← 1 2 3 4 5 →