Integrating recurrent neural networks and reinforcement learning for dynamic service composition

被引：25

作者：

Wang, Hongbing ^{[1
,2
]}

Li, Jiajie ^{[1
,2
]}

Yu, Qi ^{[3
]}

Hong, Tianjing ^{[1
,2
]}

Yan, Jia ^{[1
,2
]}

Zhao, Wei ^{[1
,2
]}

机构：

[1] Southeast Univ, Sch Comp Sci & Engn, SIPAILOU 2, Nanjing 210096, Peoples R China

[2] Southeast Univ, Key Lab Comp Network & Informat Integrat, SIPAILOU 2, Nanjing 210096, Peoples R China

[3] Rochester Inst Tech, Coll Comp & Informat Sci, Rochester, NY USA

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2020年 / 107卷

基金：

美国国家科学基金会;

关键词：

Service composition; QoS prediction; Recurrent neural network; Reinforcement learning; MODEL;

D O I：

10.1016/j.future.2020.02.030

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the service oriented architecture (SOA), software and systems are abstracted as web services to be invoked by other systems. Service composition is a technology, which builds a complex system by combining existing simple services. With the development of SOA and web service technology, massive web services with the same function begin to spring up. These services are maintained by different organizations and have different QoS (Quality of Service). Thus, how to choose the appropriate service to make the whole system to deliver the best overall QoS has become a key problem in service composition research. Furthermore, because of the complexity and dynamics of the network environment, QoS may change over time. Therefore, how to adjust the composition system dynamically to adapt to the changing environment and ensure the quality of the composed service also poses challenges. To address the above challenges, we propose a service composition approach based on QoS prediction and reinforcement learning. Specifically, we use a recurrent neural network to predict the QoS, and then make dynamic service selection through reinforcement learning. This approach can be well adapted to a dynamic network environment. We carry out a series of experiments to verify the effectiveness of our approach. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：551 / 563

页数：13

共 50 条

[1] Reinforcement learning of dynamic behavior by using recurrent neural networks
Ahmet Onat
Hajime Kita
Yoshikazu Nishikawa
[J]. Artificial Life and Robotics, 1997, 1 (3) : 117 - 121
[2] Stable reinforcement learning with recurrent neural networks
Knight J.N.
Anderson C.
[J]. Journal of Control Theory and Applications, 2011, 9 (3): : 410 - 420
[3] Stable reinforcement learning with recurrent neural networks
James Nate KNIGHT
Charles ANDERSON
[J]. Control Theory and Technology, 2011, 9 (03) : 410 - 420
[4] Fuzzy inference-based reinforcement learning of dynamic recurrent neural networks
Jun, HB
Lee, DW
Kim, DJ
Sim, KB
[J]. SICE '97 - PROCEEDINGS OF THE 36TH SICE ANNUAL CONFERENCE, INTERNATIONAL SESSION PAPERS, 1997, : 1083 - 1088
[5] Deep Reinforcement Learning With Bidirectional Recurrent Neural Networks for Dynamic Spectrum Access
Chen, Peng
Quo, Shizeng
Gao, Yulong
[J]. 2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
[6] Integrating Gaussian Process with Reinforcement Learning for Adaptive Service Composition
Wang, Hongbing
Wu, Qin
Chen, Xin
Yu, Qi
[J]. SERVICE-ORIENTED COMPUTING, (ICSOC 2015), 2015, 9435 : 203 - 217
[7] Integrating reinforcement learning and skyline computing for adaptive service composition
Wang, Hongbing
Hu, Xingguo
Yu, Qi
Gu, Mingzhu
Zhao, Wei
Yan, Jia
Hong, Tianjing
[J]. INFORMATION SCIENCES, 2020, 519 : 141 - 160
[8] Reinforcement Learning via Recurrent Convolutional Neural Networks
Shankar, Tanmay
Dwivedy, Santosha K.
Guha, Prithwijit
[J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2592 - 2597
[9] Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks
Brosch, Tobias
Neumann, Heiko
Roelfsema, Pieter R.
[J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (10)
[10] Knowledge-based recurrent neural networks in reinforcement learning
Le, Tien Dung
Komeda, Takashi
Takagi, Motoki
[J]. PROCEDINGS OF THE 11TH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2007, : 169 - 174

← 1 2 3 4 5 →