Oikonomos-II: A Reinforcement-Learning, Resource-Recommendation System for Cloud HPC

被引：0

作者：

Betting, J. L. F. ^{[1
]}

De Zeeuw, C. I. ^{[1
,2
]}

Strydis, C. ^{[1
,3
]}

机构：

[1] Erasmus MC, Dept Neurosci, Rotterdam, Netherlands

[2] Netherlands Inst Neurosci, Amsterdam, Netherlands

[3] Delft Univ Technol, Quantum & Comp Engn Dept, Delft, Netherlands

来源：

2023 IEEE 30TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS, HIPC 2023 | 2023年

基金：

荷兰研究理事会;

关键词：

High-Performance Computing; resource recommendation; cloud computing; prediction; middleware;

D O I：

10.1109/HiPC58850.2023.00044

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The cloud has become a powerful and useful environment for the deployment of High-Performance Computing (HPC) applications, but the large number of available instance types poses a challenge in selecting the optimal platform. Users often do not have the time or knowledge necessary to make an optimal choice. Recommender systems have been developed for this purpose but current state-of-the-art systems either require large amounts of training data, or require running the application multiple times; this is costly. In this work, we propose Oikonomos-II, a resource-recommendation system based on reinforcement learning for HPC applications in the cloud. Oikonomos-II models the relationship between different input parameters, instance types, and execution times. The system does not require any preexisting training data or repeated job executions, as it gathers its own training data opportunistically using user-submitted jobs, employing a variant of the Neural-LinUCB algorithm. When deployed on a mix of HPC applications, Oikonomos-II quickly converged towards an optimal policy. The system eliminates the need for preexisting training data or auxiliary runs, providing an economical, general-purpose, resource-recommendation system for cloud HPC.

引用

页码：266 / 276

页数：11

共 50 条

[41] Reinforcement Learning-Based Resource Partitioning for Improving Responsiveness in Cloud Gaming
Li, Yusen
Wang, Xiwei
Liu, Haoyuan
Pu, Lingjun
Tang, Shanjiang
Wang, Gang
Liu, Xiaoguang
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1049 - 1062
[42] Deep reinforcement learning based resource allocation in edge-cloud gaming
Jaya I.
Li Y.
Cai W.
[J]. Multimedia Tools and Applications, 2024, 83 (26) : 67903 - 67926
[43] Reinforcement Learning on Computational Resource Allocation of Cloud-based Wireless Networks
Chen, Beiran
Zhang, Yi
Iosifidis, George
Liu, Mingming
[J]. 2020 IEEE 6TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2020,
[44] Generative Adversarial User Model for Reinforcement Learning Based Recommendation System
Chen, Xinshi
Li, Shuang
Li, Hui
Jiang, Shaohua
Qi, Yuan
Song, Le
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[45] A Recommendation System Framework for Educational Content Reinforcement in Virtual Learning Environments
Damasceno, Adson R. P.
Carneiro, Lucas C.
De Sampaio, Joao Victor F. T.
Dantas, Allberson B. O.
Magalhaes, Eudenia
Maia, Paulo Henrique M.
Oliveira, Francisco C. M. B.
[J]. CSEDU: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION - VOL 1, 2022, : 228 - 235
[46] Deep Reinforcement Learning Recommendation System based on GRU and Attention Mechanism
Hou, Yan-e
Gu, Wenbo
Yang, Kang
Dang, Lanxue
[J]. ENGINEERING LETTERS, 2023, 31 (02) : 695 - 701
[47] A Service Recommendation System Based on Dynamic User Groups and Reinforcement Learning
Zhang, En
Ma, Wenming
Zhang, Jinkai
Xia, Xuchen
[J]. ELECTRONICS, 2023, 12 (24)
[48] Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards
Ferreira, Emmanuel
Lefevre, Fabrice
[J]. COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 256 - 274
[49] RLPRAF: Reinforcement Learning-Based Proactive Resource Allocation Framework for Resource Provisioning in Cloud Environment
Panwar, Reena
Supriya, M.
[J]. IEEE ACCESS, 2024, 12 : 95986 - 96007
[50] A Novel Cloud Services Recommendation System Based on Automatic Learning Techniques
Djiroun, Rahma
Guessoum, Meriem Amel
Boukhalfa, Kamel
Benkhelifa, Elhadj
[J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 42 - 49

← 1 2 3 4 5 →