Adaptive service function chaining mappings in 5G using deep Q-learning

被引:25
|
作者
Li, Guanglei [1 ]
Feng, Bohao [1 ]
Zhou, Huachun [1 ]
Zhang, Yuming [1 ]
Sood, Keshav [2 ]
Yu, Shui [3 ]
机构
[1] Beijing Jiaotong Univ, Inst Elect & Informat Engn, Beijing 100044, Peoples R China
[2] Deakin Univ, Sch Informat Technol, Melbourne, Vic 3125, Australia
[3] Univ Technol Sydney, Sch Comp Sci, Sydney, NSW 2007, Australia
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Resource allocation; Service function chaining; Network function virtualization; Deep reinforcement learning; FUNCTION PLACEMENT; REINFORCEMENT; ORCHESTRATION; OPTIMIZATION; ARCHITECTURE; NETWORKING; NFV;
D O I
10.1016/j.comcom.2020.01.035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With introduction of Software-Defined Networking (SDN) and Network Functions Virtualization (NFV) technologies, mobile network operators are able to provide on-demand Service Function Chaining (SFC) to meet various needs from users. However, it is challenging to map multiple SFCs to substrate networks efficiently, particularly in a number of key scenarios of forthcoming 5G, where user requests have different priorities and various resource demands. To this end, we first formulate the mapping of multiple SFCs with priorities as a multi-step Linear Integer Programming (ILP) problem, of which the mapping strategy (i.e., the objective function) in each step is configurable to improve overall CPU and bandwidth resource utilization rates. Secondly, to solve the strategy selection problem in each step and alleviate the complexity of ILP, we propose an adaptive deep Q-learning based SFC mapping approach (ADAP), where an agent is learned to make decisions from two low-complexity heuristic SFC mapping algorithms. Finally, we conduct extensive simulations using multiple SFC requests with randomly generated CPU and bandwidth demands in a real-world substrate network topology. Related results demonstrate that compared with a single strategy or random selections of strategies under the ILP-based approach or the proposed heuristic algorithms, our ADAP approach can improve whole-system resource efficiency by scheduling this two simply designed heuristic algorithms properly after limited training episodes.
引用
收藏
页码:305 / 315
页数:11
相关论文
共 50 条
  • [1] Adaptive Ant Colony Optimization for Service Function Chaining in a Dynamic 5G Network
    Moreno, Segundo
    Mora, Antonio M.
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 151 - 164
  • [2] Distributed Q-Learning Approach for Adaptive Sleep Modes in 5G Networks
    El-Amine, Ali
    Iturralde, Mauricio
    Hassan, Hussein Al Haj
    Nuaymi, Loutfi
    [J]. 2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
  • [3] Enhancing Quality of Experience of 5G Users Exploiting Deep Q-Learning
    Chaity, Rusmita Halim
    Roy, Palash
    Razzaque, Md Abdur
    Sadiquzzaman, Md
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2021,
  • [4] Enabling Machine Learning with Service Function Chaining for Security Enhancement at 5G Edges
    Feng, Bohao
    Zhou, Huachun
    Li, Guanglei
    Zhang, Yuming
    Sood, Keshav
    Yu, Shui
    [J]. IEEE NETWORK, 2021, 35 (05): : 196 - 201
  • [5] Advanced Conditional Handover in 5G and Beyond using Q-Learning
    Sundararaju, Sathia Chandrane
    Ramamoorthy, Shrinath
    Basavaraj, Dandra Prasad
    Phanindhar, Vanama
    [J]. 2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [6] A Q-learning strategy for federation of 5G services
    Antevski, Kiril
    Martin-Perez, Jorge
    Garcia-Saavedra, Andres
    Bernardos, Carlos J.
    Li, Xi
    Baranda, Jorge
    Mangues-Bafalluy, Josep
    Martnez, Ricardo
    Vettori, Luca
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [7] Q-Learning based Link Adaptation in 5G
    Wu, Shangbin
    Tsoukaneri, Galini
    Mouhouche, Belkacem
    [J]. 2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
  • [8] Q-learning based Service Function Chaining using VNF Resource-aware Reward Model
    Lee, Doyoung
    Yoo, Jae-Hyoung
    Hong, James Won-Ki
    [J]. APNOMS 2020: 2020 21ST ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2020, : 279 - 282
  • [9] Adaptive and Dynamic Service Composition Using Q-Learning
    Wang, Hongbing
    Zhou, Xuan
    Zhou, Xiang
    Liu, Weihong
    Li, Wenya
    [J]. 22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 1, 2010,
  • [10] Service Function Chain Reconfiguration in 5G Core Networks Using Deep Learning
    Setayesh, Mehdi
    Wong, Vincent W. S.
    [J]. 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,