Oikonomos-II: A Reinforcement-Learning, Resource-Recommendation System for Cloud HPC

被引:0
|
作者
Betting, J. L. F. [1 ]
De Zeeuw, C. I. [1 ,2 ]
Strydis, C. [1 ,3 ]
机构
[1] Erasmus MC, Dept Neurosci, Rotterdam, Netherlands
[2] Netherlands Inst Neurosci, Amsterdam, Netherlands
[3] Delft Univ Technol, Quantum & Comp Engn Dept, Delft, Netherlands
基金
荷兰研究理事会;
关键词
High-Performance Computing; resource recommendation; cloud computing; prediction; middleware;
D O I
10.1109/HiPC58850.2023.00044
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The cloud has become a powerful and useful environment for the deployment of High-Performance Computing (HPC) applications, but the large number of available instance types poses a challenge in selecting the optimal platform. Users often do not have the time or knowledge necessary to make an optimal choice. Recommender systems have been developed for this purpose but current state-of-the-art systems either require large amounts of training data, or require running the application multiple times; this is costly. In this work, we propose Oikonomos-II, a resource-recommendation system based on reinforcement learning for HPC applications in the cloud. Oikonomos-II models the relationship between different input parameters, instance types, and execution times. The system does not require any preexisting training data or repeated job executions, as it gathers its own training data opportunistically using user-submitted jobs, employing a variant of the Neural-LinUCB algorithm. When deployed on a mix of HPC applications, Oikonomos-II quickly converged towards an optimal policy. The system eliminates the need for preexisting training data or auxiliary runs, providing an economical, general-purpose, resource-recommendation system for cloud HPC.
引用
收藏
页码:266 / 276
页数:11
相关论文
共 50 条
  • [41] Reinforcement Learning-Based Resource Partitioning for Improving Responsiveness in Cloud Gaming
    Li, Yusen
    Wang, Xiwei
    Liu, Haoyuan
    Pu, Lingjun
    Tang, Shanjiang
    Wang, Gang
    Liu, Xiaoguang
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1049 - 1062
  • [42] Deep reinforcement learning based resource allocation in edge-cloud gaming
    Jaya I.
    Li Y.
    Cai W.
    [J]. Multimedia Tools and Applications, 2024, 83 (26) : 67903 - 67926
  • [43] Reinforcement Learning on Computational Resource Allocation of Cloud-based Wireless Networks
    Chen, Beiran
    Zhang, Yi
    Iosifidis, George
    Liu, Mingming
    [J]. 2020 IEEE 6TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2020,
  • [44] Generative Adversarial User Model for Reinforcement Learning Based Recommendation System
    Chen, Xinshi
    Li, Shuang
    Li, Hui
    Jiang, Shaohua
    Qi, Yuan
    Song, Le
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [45] A Recommendation System Framework for Educational Content Reinforcement in Virtual Learning Environments
    Damasceno, Adson R. P.
    Carneiro, Lucas C.
    De Sampaio, Joao Victor F. T.
    Dantas, Allberson B. O.
    Magalhaes, Eudenia
    Maia, Paulo Henrique M.
    Oliveira, Francisco C. M. B.
    [J]. CSEDU: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION - VOL 1, 2022, : 228 - 235
  • [46] Deep Reinforcement Learning Recommendation System based on GRU and Attention Mechanism
    Hou, Yan-e
    Gu, Wenbo
    Yang, Kang
    Dang, Lanxue
    [J]. ENGINEERING LETTERS, 2023, 31 (02) : 695 - 701
  • [47] A Service Recommendation System Based on Dynamic User Groups and Reinforcement Learning
    Zhang, En
    Ma, Wenming
    Zhang, Jinkai
    Xia, Xuchen
    [J]. ELECTRONICS, 2023, 12 (24)
  • [48] Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards
    Ferreira, Emmanuel
    Lefevre, Fabrice
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 256 - 274
  • [49] RLPRAF: Reinforcement Learning-Based Proactive Resource Allocation Framework for Resource Provisioning in Cloud Environment
    Panwar, Reena
    Supriya, M.
    [J]. IEEE ACCESS, 2024, 12 : 95986 - 96007
  • [50] A Novel Cloud Services Recommendation System Based on Automatic Learning Techniques
    Djiroun, Rahma
    Guessoum, Meriem Amel
    Boukhalfa, Kamel
    Benkhelifa, Elhadj
    [J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 42 - 49