A Novel Adaptive Resource Allocation Model Based on SMDP and Reinforcement Learning Algorithm in Vehicular Cloud System

被引：45

作者：

Liang, Hongbin ^{[1
,2
]}

Zhang, Xiaohui ^{[1
,2
,3
]}

Zhang, Jin ^{[1
,2
]}

Li, Qizhen ^{[3
]}

Zhou, Shuya ^{[1
,2
]}

Zhao, Lian ^{[4
]}

机构：

[1] Southwest Jiaotong Univ, Natl United Engn Lab Integrated & Intelligent Tra, Sch Transportat & Logist, Chengdu 611756, Sichuan, Peoples R China

[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Ap, Chengdu 611756, Sichuan, Peoples R China

[3] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Sichuan, Peoples R China

[4] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2019年 / 68卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Cloud computing; Resource management; Adaptation models; Adaptive systems; Quality of service; Quality of experience; Computational modeling; Semi-Markov Decision Process (SMDP); Reinforcement Learning (RL) Algorithm; Vehicular Cloud System; Neural-Network; Quality of Experience (QoE); Quality of Service (QoS); ASSIGNMENT; COMMUNICATION; OPTIMIZATION;

D O I：

10.1109/TVT.2019.2937842

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a novel adaptive cloud resource allocation model based on Semi-Markov Decision Process (SMDP) and Reinforcement Learning (RL) algorithm in vehicular cloud system. The issue of adaptive resource allocation for vehicular request is formed as an SMDP in order to gain the dynamics of vehicular requests arrival and departure. An optimized decision is made to guarantee the Quality of Service (QoS) of the vehicular cloud system and the Quality of Experience (QoE) of the vehicular users as well as to maximize the total system reward of the vehicular cloud system in consideration of the balance between the vehicular cloud resource expense and the system income. Furthermore, to capture the mobility feature of the vehicular cloud system, we also apply a neural-network-based RL algorithm to resolve our proposed SMDP-based adaptive cloud resource allocation model. Firstly, we use a Planning algorithm to get the action values under certain state-action pairs, which are the initial samples to train the neural network. Then the RL is used to update the parameters of the neural network, train the neural network and adaptively improve the decision strategy. Subsequently, an adaptive vehicular cloud resource allocation scheme which can approach the optimal strategy is obtained without the knowledge of the distribution function of vehicular requests arrival and departure during the RL process. The simulation results show that our proposed adaptive cloud resource allocation model for vehicular cloud system can reduce the probability of delay in processing requests and achieve high system rewards in comparison with the regularly used greedy resource allocation method. The performance of the RL solution approaches that of traditional value iteration solution for our proposed adaptive cloud resource allocation model.

引用

页码：10018 / 10029

页数：12

共 50 条

[11] Content Driven and Reinforcement Learning Based Resource Allocation Scheme in Vehicular Network
Chen, Jiujiu
Guo, Caili
Feng, Chunyan
Zhu, Meiyi
Sun, Qizheng
[J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[12] Adaptive Resource Allocation for Anti-money Laundering Based on SMDP
Hong, Xintao
Liang, Hongbin
Gao, Zengan
[J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, 2015, 9204 : 190 - 200
[13] A Dynamic Resource Allocation Model Based on SMDP and DRL Algorithm for Truck Platoon in Vehicle Network
Liang, Hongbin
Zhou, Shuya
Liu, Xiaobo
Zheng, Fangfang
Hong, Xintao
Zhou, Xuemei
Zhao, Lian
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) : 10295 - 10305
[14] Resource allocation algorithm for MEC based on Deep Reinforcement Learning
Wang, Yijie
Chen, Xin
Chen, Ying
Du, Shougang
[J]. 2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
[15] Deep reinforcement learning-based joint optimization model for vehicular task offloading and resource allocation
Li, Zhi-Yuan
Zhang, Zeng-Xiang
[J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2024, 17 (04) : 2001 - 2015
[16] A Reinforcement Learning-Based Resource Allocation Scheme for Cloud Robotics
Liu, Hang
Liu, Shiwen
Zheng, Kan
[J]. IEEE ACCESS, 2018, 6 : 17215 - 17222
[17] FRAC: a flexible resource allocation for vehicular cloud system
Pradhan, Srikanta
Tripathy, Somanath
[J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (14) : 2141 - 2150
[18] Deep Reinforcement Learning Based Resource Allocation Strategy in Cloud-Edge Computing System
Xu, Zhuohan
Zhong, Zeheng
Shi, Bing
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[19] Deep Reinforcement Learning Based Resource Allocation Strategy in Cloud-Edge Computing System
Xu, Jianqiao
Xu, Zhuohan
Shi, Bing
[J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
[20] Adaptive Q-learning-supported Resource Allocation Model in Vehicular Fogs
Hossain, Md Tahmid
de Grande, Robson E.
[J]. 2022 27TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2022), 2022,

← 1 2 3 4 5 →