HephaestusForge: Optimal microservice deployment across the Compute Continuum via Reinforcement Learning

被引：0

作者：

Santos, Jose ^{[1
]}

Zaccarini, Mattia ^{[2
]}

Poltronieri, Filippo ^{[2
]}

Tortonesi, Mauro ^{[2
]}

Stefanelli, Cesare ^{[2
]}

Di Cicco, Nicola ^{[3
]}

De Turck, Filip ^{[1
]}

机构：

[1] Univ Ghent, Dept Informat Technol, IDLab, Imec, Technol Pk Zwijnaarde 126, B-9052 Ghent, Belgium

[2] Univ Ferrara, Distributed Syst Res Grp, Ferrara, Italy

[3] Politecn Milan, Dept Elect Informat & Bioengn DEIB, Milan, Italy

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2025年 / 166卷

关键词：

Kubernetes; Orchestration; Microservices; Reinforcement Learning; Resource allocation; Compute Continuum; SERVICE FUNCTION CHAIN; CLOUD; ORCHESTRATION;

D O I：

10.1016/j.future.2024.107680

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

With the advent of containerization technologies, microservices have revolutionized application deployment by converting old monolithic software into a group of loosely coupled containers, aiming to offer greater flexibility and improve operational efficiency. This transition made applications more complex, consisting of tens to hundreds of microservices. Designing effective orchestration mechanisms remains a crucial challenge, especially for emerging distributed cloud paradigms such as the Compute Continuum (CC). Orchestration across multiple clusters is still not extensively explored in the literature since most works consider single- cluster scenarios. In the CC scenario, the orchestrator must decide the optimal locations for each microservice, deciding whether instances are deployed altogether or placed across different clusters, significantly increasing orchestration complexity. This paper addresses orchestration in a containerized CC environment by studying a Reinforcement Learning (RL) approach for efficient microservice deployment in Kubernetes (K8s) clusters, a widely adopted container orchestration platform. This work demonstrates the effectiveness of RL in achieving near-optimal deployment schemes under dynamic conditions, where network latency and resource capacity fluctuate. We extensively evaluate a multi-objective reward function that aims to minimize overall latency, reduce deployment costs, and promote fair distribution of microservice instances, and we compare it against typical heuristic-based approaches. The results from an implemented OpenAI Gym framework, named as HephaestusForge, show that RL algorithms achieve minimal rejection rates (as low as 0.002%, 90x less than the baseline Karmada scheduler). Cost-aware strategies result in lower deployment costs (2.5 units), and latency- aware functions achieve lower latency (268-290 ms), improving by 1.5x and 1.3x, respectively, over the best-performing baselines. HephaestusForge is available in a public open-source repository, allowing researchers to validate their own placement algorithms. This study also highlights the adaptability of the DeepSets (DS) neural network in optimizing microservice placement across diverse multi-cluster setups without retraining. The DS neural network can handle inputs and outputs as arbitrarily sized sets, enabling the RL algorithm to learn a policy not bound to a fixed number of clusters.

引用

页数：16

共 50 条

[31] Deep reinforcement learning-based optimal deployment of IoT machine learning jobs in fog computing architecture
Bushehrian, Omid
Moazeni, Amir
COMPUTING, 2025, 107 (01)
[32] Joint Virtual Network Function Deployment and Scheduling via Heuristics and Deep Reinforcement Learning
Zhang, Zixiao
Oki, Eiji
IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (12) : 1424 - 1440
[33] Optimal deployment of sonobuoy for unmanned aerial vehicles using reinforcement learning considering the target movement
Bae, Geunyoung
Kang, Juhwan
Hong, Jungpyo
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2024, 43 (02): : 214 - 224
[34] Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system
Feng, Yiting
Wang, Changqing
Li, Aijun
IFAC PAPERSONLINE, 2020, 53 (01): : 679 - 684
[35] Optimal Energy Scheduling of Flexible Industrial Prosumers via Reinforcement Learning
van den Bovenkamp, Nick
Giraldo, Juan S.
Duque, Edgar Mauricio Salazar
Vergara, Pedro P.
Konstantinou, Charalambos
Palensky, Peter
2023 IEEE BELGRADE POWERTECH, 2023,
[36] Optimal Q-laws via reinforcement learning with guaranteed stability
Holt, Harry
Armellin, Roberto
Baresi, Nicola
Hashida, Yoshi
Turconi, Andrea
Scorsoglio, Andrea
Furfaro, Roberto
ACTA ASTRONAUTICA, 2021, 187 : 511 - 528
[37] Optimal Multi-impulse Linear Rendezvous via Reinforcement Learning
Xu, Longwei
Zhang, Gang
Qiu, Shi
Cao, Xibin
SPACE: SCIENCE & TECHNOLOGY, 2023, 3
[38] Multi constraint optimal intelligent gliding guidance via reinforcement learning
Zhu J.
Zhao C.
Li X.
Bao W.
Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (04): : 116 - 124
[39] Optimizing Drone Deployment for Maximized User Connectivity in Areas of Interest Via Deep Reinforcement Learning
Kolichala Rajashekar
Ashutosh Garg
Anand M. Baswade
Subhajit Sidhanta
Journal of Network and Systems Management, 2025, 33 (3)
[40] EV charging station deployment on coupled transportation and power distribution networks via reinforcement learning
Zhao, Zhonghao
Lee, Carman K. M.
Huo, Jiage
ENERGY, 2023, 267

← 1 2 3 4 5 →