A Novel Adaptive Resource Allocation Model Based on SMDP and Reinforcement Learning Algorithm in Vehicular Cloud System

被引:45
|
作者
Liang, Hongbin [1 ,2 ]
Zhang, Xiaohui [1 ,2 ,3 ]
Zhang, Jin [1 ,2 ]
Li, Qizhen [3 ]
Zhou, Shuya [1 ,2 ]
Zhao, Lian [4 ]
机构
[1] Southwest Jiaotong Univ, Natl United Engn Lab Integrated & Intelligent Tra, Sch Transportat & Logist, Chengdu 611756, Sichuan, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Ap, Chengdu 611756, Sichuan, Peoples R China
[3] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Sichuan, Peoples R China
[4] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada
基金
中国国家自然科学基金;
关键词
Cloud computing; Resource management; Adaptation models; Adaptive systems; Quality of service; Quality of experience; Computational modeling; Semi-Markov Decision Process (SMDP); Reinforcement Learning (RL) Algorithm; Vehicular Cloud System; Neural-Network; Quality of Experience (QoE); Quality of Service (QoS); ASSIGNMENT; COMMUNICATION; OPTIMIZATION;
D O I
10.1109/TVT.2019.2937842
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a novel adaptive cloud resource allocation model based on Semi-Markov Decision Process (SMDP) and Reinforcement Learning (RL) algorithm in vehicular cloud system. The issue of adaptive resource allocation for vehicular request is formed as an SMDP in order to gain the dynamics of vehicular requests arrival and departure. An optimized decision is made to guarantee the Quality of Service (QoS) of the vehicular cloud system and the Quality of Experience (QoE) of the vehicular users as well as to maximize the total system reward of the vehicular cloud system in consideration of the balance between the vehicular cloud resource expense and the system income. Furthermore, to capture the mobility feature of the vehicular cloud system, we also apply a neural-network-based RL algorithm to resolve our proposed SMDP-based adaptive cloud resource allocation model. Firstly, we use a Planning algorithm to get the action values under certain state-action pairs, which are the initial samples to train the neural network. Then the RL is used to update the parameters of the neural network, train the neural network and adaptively improve the decision strategy. Subsequently, an adaptive vehicular cloud resource allocation scheme which can approach the optimal strategy is obtained without the knowledge of the distribution function of vehicular requests arrival and departure during the RL process. The simulation results show that our proposed adaptive cloud resource allocation model for vehicular cloud system can reduce the probability of delay in processing requests and achieve high system rewards in comparison with the regularly used greedy resource allocation method. The performance of the RL solution approaches that of traditional value iteration solution for our proposed adaptive cloud resource allocation model.
引用
收藏
页码:10018 / 10029
页数:12
相关论文
共 50 条
  • [11] Content Driven and Reinforcement Learning Based Resource Allocation Scheme in Vehicular Network
    Chen, Jiujiu
    Guo, Caili
    Feng, Chunyan
    Zhu, Meiyi
    Sun, Qizheng
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [12] Adaptive Resource Allocation for Anti-money Laundering Based on SMDP
    Hong, Xintao
    Liang, Hongbin
    Gao, Zengan
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, 2015, 9204 : 190 - 200
  • [13] A Dynamic Resource Allocation Model Based on SMDP and DRL Algorithm for Truck Platoon in Vehicle Network
    Liang, Hongbin
    Zhou, Shuya
    Liu, Xiaobo
    Zheng, Fangfang
    Hong, Xintao
    Zhou, Xuemei
    Zhao, Lian
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) : 10295 - 10305
  • [14] Resource allocation algorithm for MEC based on Deep Reinforcement Learning
    Wang, Yijie
    Chen, Xin
    Chen, Ying
    Du, Shougang
    [J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [15] Deep reinforcement learning-based joint optimization model for vehicular task offloading and resource allocation
    Li, Zhi-Yuan
    Zhang, Zeng-Xiang
    [J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (04) : 2001 - 2015
  • [16] A Reinforcement Learning-Based Resource Allocation Scheme for Cloud Robotics
    Liu, Hang
    Liu, Shiwen
    Zheng, Kan
    [J]. IEEE ACCESS, 2018, 6 : 17215 - 17222
  • [17] FRAC: a flexible resource allocation for vehicular cloud system
    Pradhan, Srikanta
    Tripathy, Somanath
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (14) : 2141 - 2150
  • [18] Deep Reinforcement Learning Based Resource Allocation Strategy in Cloud-Edge Computing System
    Xu, Zhuohan
    Zhong, Zeheng
    Shi, Bing
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [19] Deep Reinforcement Learning Based Resource Allocation Strategy in Cloud-Edge Computing System
    Xu, Jianqiao
    Xu, Zhuohan
    Shi, Bing
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [20] Adaptive Q-learning-supported Resource Allocation Model in Vehicular Fogs
    Hossain, Md Tahmid
    de Grande, Robson E.
    [J]. 2022 27TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2022), 2022,