A Lightweight Reinforcement Learning Based Packet Routing Method Using Online Sequential Learning

被引:1
|
作者
Nemoto, Kenji [1 ]
Matsutani, Hiroki [1 ]
机构
[1] Keio Univ, Grad Sch Sci & Technol, Yokohama 2238522, Japan
关键词
reinforcement learning; packet routing; neural networks; OS-ELM;
D O I
10.1587/transinf.2022EDP7231
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing simple routing protocols (e.g., OSPF, RIP) have some disadvantages of being inflexible and prone to congestion due to the concentration of packets on particular routers. To address these issues, packet routing methods using machine learning have been proposed recently. Compared to these algorithms, machine learning based methods can choose a routing path intelligently by learning efficient routes. However, machine learning based methods have a disadvantage of training time overhead. We thus focus on a lightweight machine learning algorithm, OS-ELM (Online Sequential Extreme Learning Machine), to reduce the training time. Although previous work on reinforcement learning using OS-ELM exists, it has a problem of low learning accuracy. In this paper, we propose OS-ELM QN (Q-Network) with a prioritized experience replay buffer to improve the learning performance. It is compared to a deep reinforcement learning based packet routing method using a network simulator. Experimental results show that introducing the experience replay buffer improves the learning performance. OS-ELM QN achieves a 2.33 times speedup than a DQN (Deep Q-Network) in terms of learning speed. Regarding the packet transfer latency, OS-ELM QN is comparable or slightly inferior to the DQN while they are better than OSPF in most cases since they can distribute congestions.
引用
收藏
页码:1796 / 1807
页数:12
相关论文
共 50 条
  • [1] A Packet Routing using Lightweight Reinforcement Learning Based on Online Sequential Learning
    Nemoto, Kenji
    Matsutani, Hiroki
    2022 TENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS, CANDARW, 2022, : 76 - 82
  • [2] Packet Routing Method for Multi-Stage Networks Based on Reinforcement Learning
    Gao Y.
    Luo L.
    Sun G.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (02): : 200 - 206
  • [3] An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning
    Watanabe, Hirohisa
    Tsukada, Mineto
    Matsutani, Hiroki
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 96 - 103
  • [4] Reinforcement learning-based load shared sequential routing
    Heidari, Fariba
    Manner, Shie
    Mason, Lorne G.
    NETWORKING 2007: AD HOC AND SENSOR NETWORKS, WIRELESS NETWORKS, NEXT GENERATION INTERNET, PROCEEDINGS, 2007, 4479 : 832 - +
  • [5] A Deep Reinforcement Learning-Based Geographic Packet Routing Optimization
    Bai, Yijie
    Zhang, Xia
    Yu, Daojie
    Li, Shengxiang
    Wang, Yu
    Lei, Shuntian
    Tian, Zhoutai
    IEEE ACCESS, 2022, 10 : 108785 - 108796
  • [6] CoDRL: Intelligent Packet Routing in SDN Using Convolutional Deep Reinforcement Learning
    Swain, Pravati
    Kamalia, Uttam
    Bhandarkar, Raj
    Modi, Tejas
    13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATION SYSTEMS (IEEE ANTS), 2019,
  • [7] Dynamic Packet Routing Algorithm Based on Multidimensional Information and Multiagent Reinforcement Learning
    Zhang, Linliang
    Du, Ruifang
    Hao, Zhiqiang
    Li, Shuo
    Hu, Zhiguo
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2025, 38 (06)
  • [8] Online mobile learning resource recommendation method based on deep reinforcement learning
    Li, Pingyang
    Zhang, Juan
    INTERNATIONAL JOURNAL OF INNOVATION AND SUSTAINABLE DEVELOPMENT, 2025, 19 (01)
  • [9] An online feature learning algorithm using HCI-based reinforcement learning
    Liu, F
    Su, JB
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 293 - 298
  • [10] AN ONLINE CROWD SEMANTIC SEGMENTATION METHOD BASED ON REINFORCEMENT LEARNING
    Cheng, Yu
    Yang, Hua
    Chen, Lin
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2429 - 2433