Multiagent Deep Reinforcement Learning for Cost- and Delay-Sensitive Virtual Network Function Placement and Routing

被引:14
|
作者
Wang, Shaoyang [1 ]
Yuen, Chau [2 ]
Ni, Wei [3 ]
Guan, Yong Liang [4 ]
Lv, Tiejun [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] SUTD, Engn Prod Dev, Somapah Rd, Singapore 487372, Singapore
[3] Commonwealth Sci & Ind Res Org CSIRO, Sydney, NSW 2122, Australia
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Nanyang Ave, Singapore 639798, Singapore
关键词
Routing; Costs; Delays; Optimization; Network topology; Topology; Minimization; Placement and routing; multi-agent deep reinforcement learning; virtual network functions; VNF PLACEMENT; AWARE; ALLOCATION; DEPLOYMENT;
D O I
10.1109/TCOMM.2022.3187146
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes an effective and novel multi-agent deep reinforcement learning (MADRL)-based method for solving the joint virtual network function (VNF) placement and routing (P&R), where multiple service requests with differentiated demands are delivered at the same time. The differentiated demands of the service requests are reflected by their delay- and cost-sensitive factors. We first construct a VNF P&R problem to jointly minimize a weighted sum of service delay and resource consumption cost, which is NP-complete. Then, the joint VNF P&R problem is decoupled into two iterative subtasks: placement subtask and routing subtask. Each subtask consists of multiple concurrent parallel sequential decision processes. By invoking the deep deterministic policy gradient method and multi-agent technique, an MADRL-P&R framework is designed to perform the two subtasks. The new joint reward and internal rewards mechanism is proposed to match the goals and constraints of the placement and routing subtasks. We also propose the parameter migration-based model-retraining method to deal with changing network topologies. Corroborated by experiments, the proposed MADRL-P&R framework is superior to its alternatives in terms of service cost and delay, and offers higher flexibility for personalized service demands. The parameter migration-based model-retraining method can efficiently accelerate convergence under moderate network topology changes.
引用
收藏
页码:5208 / 5224
页数:17
相关论文
共 50 条
  • [31] Reinforcement Learning for Optimizing Delay-Sensitive Task Offloading in Vehicular Edge-Cloud Computing
    Binh, Ta Huu
    Son, Do Bao
    Vo, Hiep
    Nguyen, Binh Minh
    Binh, Huynh Thi Thanh
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02): : 2058 - 2069
  • [32] Service-Aware Virtual Network Function Migration Based on Deep Reinforcement Learning
    Li, Zeming
    Liu, Ziyu
    Liang, Chengchao
    Liu, Zhanjun
    [J]. COMMUNICATIONS AND NETWORKING (CHINACOM 2021), 2022, : 481 - 496
  • [33] Joint Virtual Network Function Deployment and Scheduling via Heuristics and Deep Reinforcement Learning
    Zhang, Zixiao
    Oki, Eiji
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (12) : 1424 - 1440
  • [34] Virtual network function deployment algorithm based on graph convolution deep reinforcement learning
    Qiu, Rixuan
    Bao, Jiawen
    Li, Yuancheng
    Zhou, Xin
    Liang, Liang
    Tian, Hui
    Zeng, Yanting
    Shi, Jie
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (06): : 6849 - 6870
  • [35] Virtual network function deployment algorithm based on graph convolution deep reinforcement learning
    Rixuan Qiu
    Jiawen Bao
    Yuancheng Li
    Xin Zhou
    Liang Liang
    Hui Tian
    Yanting Zeng
    Jie Shi
    [J]. The Journal of Supercomputing, 2023, 79 : 6849 - 6870
  • [36] Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors
    Sharma, Nikhilesh
    Mastronarde, Nicholas
    Chakareski, Jacob
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 1409 - 1424
  • [37] LOW-COMPLEXITY REINFORCEMENT LEARNING FOR DELAY-SENSITIVE COMPRESSION IN NETWORKED VIDEO STREAM MINING
    Zhu, Xiaoqing
    Lan, Cuiling
    van der Schaar, Mihaela
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [38] Deep Reinforcement Learning for Controller Placement in Software Defined Network
    Wu, Yiwen
    Zhou, Sipei
    Wei, Yunkai
    Leng, Supeng
    [J]. IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 1254 - 1259
  • [39] Network routing optimization approach based on deep reinforcement learning
    Meng L.
    Guo B.
    Yang W.
    Zhang X.
    Zhao Z.
    Huang S.
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (07): : 2311 - 2318
  • [40] Deep Reinforcement Learning for Resource Demand Prediction and Virtual Function Network Migration in Digital Twin Network
    Liu, Qinghai
    Tang, Lun
    Wu, Ting
    Chen, Qianbin
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (21) : 19102 - 19116