Joint Multi-Objective Optimization for Radio Access Network Slicing Using Multi-Agent Deep Reinforcement Learning

被引:10
|
作者
Zhou, Guorong [1 ]
Zhao, Liqiang [1 ,2 ]
Zheng, Gan [3 ]
Xie, Zhijie [4 ]
Song, Shenghui [4 ]
Chen, Kwang-Cheng [5 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[2] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510100, Peoples R China
[3] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[5] Univ S Florida, Dept Elect Engn, Tampa, FL 33620 USA
基金
国家重点研发计划;
关键词
Radio access network slicing; multi-objective optimization; non-scalarization; multi-agent deep reinforcement learning; rank voting method; RESOURCE-ALLOCATION; SOFTWARIZATION; ALGORITHMS; MANAGEMENT; EFFICIENCY; STRATEGY; METRICS; SLICES; GAME; 6G;
D O I
10.1109/TVT.2023.3268671
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Radio access network (RAN) slices can provide various customized services for next-generation wireless networks. Thus, multiple performance metrics of different types of RAN slices need to be jointly optimized. However, existing efforts in multi-objective optimization problem (MOOP) for RAN slicing are only in the scalar form, which is difficult to achieve simultaneous optimization. In this paper, we consider a non-scalar MOOP for RAN slicing with three types of slices, i.e., the high-bandwidth slice, the low-delay slice, and the wide-coverage slice over the same underlying physical network. We jointly optimize the throughput, the transmission delay, and the coverage area by user-oriented dynamic virtual base stations (vBSs)' deployment, and sub-channel and power allocation. An improved multi-agent deep deterministic policy gradient (IMADDPG) algorithm, having the characteristics of centralized training and distributed execution, is proposed to solve the above non-deterministic polynomial-time hard (NP-hard) problem. The rank voting method is introduced in the inference process to obtain near-Pareto optimal solutions. Simulation results verify that the proposed scheme can ensure better performance than the traditional scalar utility method and other benchmark algorithms. The proposed scheme has the advantage of flexibly approaching any point of the Pareto boundary, while the traditional scalar method only subjectively approaches one of the Pareto optimal solutions. Furthermore, our proposal strikes a compelling tradeoff among three types of RAN slices due to the non-dominance between Pareto optimal solutions.
引用
收藏
页码:11828 / 11843
页数:16
相关论文
共 50 条
  • [1] Multi-objective optimization of the textile manufacturing process using deep-Q-network based multi-agent reinforcement learning
    He, Zhenglei
    Thomassey, Sebastien
    Zeng, Xianyi
    Xu, Jie
    Yi, Changhai
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 939 - 949
  • [2] Multi-Agent Deep Reinforcement Learning for Resource Allocation in the Multi-Objective HetNet
    Nie, Hongrui
    Li, Shaosheng
    Liu, Yong
    [J]. IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 116 - 121
  • [3] Multi-Agent Deep Reinforcement Learning Joint Beamforming for Slicing Resource Allocation
    Yan, Dandan
    Ng, Benjamin K.
    Ke, Wei
    Lam, Chan-Tong
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (05) : 1220 - 1224
  • [4] Multi-objective optimization of truss structure using multi-agent reinforcement learning and graph representation
    Kupwiwat, Chi-tathon
    Hayashi, Kazuki
    Ohsaki, Makoto
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [5] Multi-Objective Workflow Scheduling With Deep-Q-Network-Based Multi-Agent Reinforcement Learning
    Wang, Yuandou
    Liu, Hang
    Zheng, Wanbo
    Xia, Yunni
    Li, Yawen
    Chen, Peng
    Guo, Kunyin
    Xie, Hong
    [J]. IEEE ACCESS, 2019, 7 : 39974 - 39982
  • [6] Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach
    Mlika, Zoubeir
    Cherkaoui, Soumaya
    [J]. ANNALS OF TELECOMMUNICATIONS, 2021, 76 (9-10) : 665 - 683
  • [7] Network slicing for vehicular communications: a multi-agent deep reinforcement learning approach
    Zoubeir Mlika
    Soumaya Cherkaoui
    [J]. Annals of Telecommunications, 2021, 76 : 665 - 683
  • [8] A multi-objective multi-agent deep reinforcement learning approach to residential appliance scheduling
    Lu, Junlin
    Mannion, Patrick
    Mason, Karl
    [J]. IET SMART GRID, 2022, 5 (04) : 260 - 280
  • [9] Multi-Objective Dynamic Dispatch Optimisation using Multi-Agent Reinforcement Learning
    Mannion, Patrick
    Mason, Karl
    Devlin, Sam
    Duggan, Jim
    Howley, Enda
    [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1345 - 1346
  • [10] Multi-objective optimization of turbine blade profiles based on multi-agent reinforcement learning
    Li, Lele
    Zhang, Weihao
    Li, Ya
    Jiang, Chiju
    Wang, Yufan
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2023, 297