Multiagent Deep Reinforcement Learning for Cost- and Delay-Sensitive Virtual Network Function Placement and Routing

被引:14
|
作者
Wang, Shaoyang [1 ]
Yuen, Chau [2 ]
Ni, Wei [3 ]
Guan, Yong Liang [4 ]
Lv, Tiejun [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] SUTD, Engn Prod Dev, Somapah Rd, Singapore 487372, Singapore
[3] Commonwealth Sci & Ind Res Org CSIRO, Sydney, NSW 2122, Australia
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Nanyang Ave, Singapore 639798, Singapore
关键词
Routing; Costs; Delays; Optimization; Network topology; Topology; Minimization; Placement and routing; multi-agent deep reinforcement learning; virtual network functions; VNF PLACEMENT; AWARE; ALLOCATION; DEPLOYMENT;
D O I
10.1109/TCOMM.2022.3187146
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes an effective and novel multi-agent deep reinforcement learning (MADRL)-based method for solving the joint virtual network function (VNF) placement and routing (P&R), where multiple service requests with differentiated demands are delivered at the same time. The differentiated demands of the service requests are reflected by their delay- and cost-sensitive factors. We first construct a VNF P&R problem to jointly minimize a weighted sum of service delay and resource consumption cost, which is NP-complete. Then, the joint VNF P&R problem is decoupled into two iterative subtasks: placement subtask and routing subtask. Each subtask consists of multiple concurrent parallel sequential decision processes. By invoking the deep deterministic policy gradient method and multi-agent technique, an MADRL-P&R framework is designed to perform the two subtasks. The new joint reward and internal rewards mechanism is proposed to match the goals and constraints of the placement and routing subtasks. We also propose the parameter migration-based model-retraining method to deal with changing network topologies. Corroborated by experiments, the proposed MADRL-P&R framework is superior to its alternatives in terms of service cost and delay, and offers higher flexibility for personalized service demands. The parameter migration-based model-retraining method can efficiently accelerate convergence under moderate network topology changes.
引用
收藏
页码:5208 / 5224
页数:17
相关论文
共 50 条
  • [1] Delay Sensitive Virtual Network Function Placement and Routing
    Gouareb, Racha
    Friderikos, Vasilis
    Aghvami, A. Hamid
    [J]. 2018 25TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS (ICT), 2018, : 394 - 398
  • [2] Deep Reinforcement Learning Based Dynamic Routing Optimization for Delay-Sensitive Applications
    Chen, Jiawei
    Xiao, Yang
    Lin, Guocheng
    He, Gang
    Liu, Fang
    Zhou, Wenli
    Liu, Jun
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 5208 - 5213
  • [3] Leveraging Deep Reinforcement Learning With Attention Mechanism for Virtual Network Function Placement and Routing
    He, Nan
    Yang, Song
    Li, Fan
    Trajanovski, Stojan
    Zhu, Liehuang
    Wang, Yu
    Fu, Xiaoming
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (04) : 1186 - 1201
  • [4] Virtual Network Function Placement Optimization With Deep Reinforcement Learning
    Solozabal, Ruben
    Ceberio, Josu
    Sanchoyerto, Aitor
    Zabala, Luis
    Blanco, Bego
    Liberal, Fidel
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (02) : 292 - 303
  • [5] Deep Reinforcement Learning for Delay-Sensitive LTE Downlink Scheduling
    Sharma, Nikhilesh
    Zhang, Sen
    Venkata, Someshwar Rao Somayajula
    Malandra, Filippo
    Mastronarde, Nicholas
    Chakareski, Jacob
    [J]. 2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
  • [6] JOSP: Joint Optimization of Flow Path Scheduling and Virtual Network Function Placement for Delay-Sensitive Applications
    Qing Lyu
    Yonghang Zhou
    Qilin Fan
    Yongqiang Lyu
    Xi Zheng
    Guangquan Xu
    Jun Li
    [J]. Mobile Networks and Applications, 2022, 27 : 1642 - 1658
  • [7] JOSP: Joint Optimization of Flow Path Scheduling and Virtual Network Function Placement for Delay-Sensitive Applications
    Lyu, Qing
    Zhou, Yonghang
    Fan, Qilin
    Lyu, Yongqiang
    Zheng, Xi
    Xu, Guangquan
    Li, Jun
    [J]. MOBILE NETWORKS & APPLICATIONS, 2022, 27 (04): : 1642 - 1658
  • [8] Delay-Sensitive Energy-Efficient UAV Crowdsensing by Deep Reinforcement Learning
    Dai, Zipeng
    Liu, Chi Harold
    Han, Rui
    Wang, Guoren
    Leung, Kin K. K.
    Tang, Jian
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (04) : 2038 - 2052
  • [9] Virtual Network Function Placement Optimization Algorithm Based on Improve Deep Reinforcement Learning
    Tang Lun
    He Lanqin
    Lian Qinyi
    Tan Qi
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1724 - 1732
  • [10] Delay-Sensitive and Availability-Aware Virtual Network Function Scheduling for NFV
    Yang, Song
    Li, Fan
    Yahyapour, Ramin
    Fu, Xiaoming
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (01) : 188 - 201