Integrating recurrent neural networks and reinforcement learning for dynamic service composition

被引:25
|
作者
Wang, Hongbing [1 ,2 ]
Li, Jiajie [1 ,2 ]
Yu, Qi [3 ]
Hong, Tianjing [1 ,2 ]
Yan, Jia [1 ,2 ]
Zhao, Wei [1 ,2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, SIPAILOU 2, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab Comp Network & Informat Integrat, SIPAILOU 2, Nanjing 210096, Peoples R China
[3] Rochester Inst Tech, Coll Comp & Informat Sci, Rochester, NY USA
基金
美国国家科学基金会;
关键词
Service composition; QoS prediction; Recurrent neural network; Reinforcement learning; MODEL;
D O I
10.1016/j.future.2020.02.030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the service oriented architecture (SOA), software and systems are abstracted as web services to be invoked by other systems. Service composition is a technology, which builds a complex system by combining existing simple services. With the development of SOA and web service technology, massive web services with the same function begin to spring up. These services are maintained by different organizations and have different QoS (Quality of Service). Thus, how to choose the appropriate service to make the whole system to deliver the best overall QoS has become a key problem in service composition research. Furthermore, because of the complexity and dynamics of the network environment, QoS may change over time. Therefore, how to adjust the composition system dynamically to adapt to the changing environment and ensure the quality of the composed service also poses challenges. To address the above challenges, we propose a service composition approach based on QoS prediction and reinforcement learning. Specifically, we use a recurrent neural network to predict the QoS, and then make dynamic service selection through reinforcement learning. This approach can be well adapted to a dynamic network environment. We carry out a series of experiments to verify the effectiveness of our approach. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:551 / 563
页数:13
相关论文
共 50 条
  • [1] Reinforcement learning of dynamic behavior by using recurrent neural networks
    Ahmet Onat
    Hajime Kita
    Yoshikazu Nishikawa
    [J]. Artificial Life and Robotics, 1997, 1 (3) : 117 - 121
  • [2] Stable reinforcement learning with recurrent neural networks
    Knight J.N.
    Anderson C.
    [J]. Journal of Control Theory and Applications, 2011, 9 (3): : 410 - 420
  • [3] Stable reinforcement learning with recurrent neural networks
    James Nate KNIGHT
    Charles ANDERSON
    [J]. Control Theory and Technology, 2011, 9 (03) : 410 - 420
  • [4] Fuzzy inference-based reinforcement learning of dynamic recurrent neural networks
    Jun, HB
    Lee, DW
    Kim, DJ
    Sim, KB
    [J]. SICE '97 - PROCEEDINGS OF THE 36TH SICE ANNUAL CONFERENCE, INTERNATIONAL SESSION PAPERS, 1997, : 1083 - 1088
  • [5] Deep Reinforcement Learning With Bidirectional Recurrent Neural Networks for Dynamic Spectrum Access
    Chen, Peng
    Quo, Shizeng
    Gao, Yulong
    [J]. 2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [6] Integrating Gaussian Process with Reinforcement Learning for Adaptive Service Composition
    Wang, Hongbing
    Wu, Qin
    Chen, Xin
    Yu, Qi
    [J]. SERVICE-ORIENTED COMPUTING, (ICSOC 2015), 2015, 9435 : 203 - 217
  • [7] Integrating reinforcement learning and skyline computing for adaptive service composition
    Wang, Hongbing
    Hu, Xingguo
    Yu, Qi
    Gu, Mingzhu
    Zhao, Wei
    Yan, Jia
    Hong, Tianjing
    [J]. INFORMATION SCIENCES, 2020, 519 : 141 - 160
  • [8] Reinforcement Learning via Recurrent Convolutional Neural Networks
    Shankar, Tanmay
    Dwivedy, Santosha K.
    Guha, Prithwijit
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2592 - 2597
  • [9] Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks
    Brosch, Tobias
    Neumann, Heiko
    Roelfsema, Pieter R.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (10)
  • [10] Knowledge-based recurrent neural networks in reinforcement learning
    Le, Tien Dung
    Komeda, Takashi
    Takagi, Motoki
    [J]. PROCEDINGS OF THE 11TH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2007, : 169 - 174