Switches for HIRE: Resource Scheduling for Data Center In-Network Computing

被引:17
|
作者
Bloecher, Marcel [1 ]
Wang, Lin [1 ,2 ]
Eugster, Patrick [3 ,4 ]
Schmidt, Max [1 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Germany
[2] Vrije Univ Amsterdam, Amsterdam, Netherlands
[3] USI Lugano, Lugano, Switzerland
[4] Purdue Univ, W Lafayette, IN 47907 USA
基金
瑞士国家科学基金会; 美国国家科学基金会; 欧洲研究理事会;
关键词
data center; scheduling; in-network computing; heterogeneity; nonlinear resource usage;
D O I
10.1145/3445814.3446760
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The recent trend towards more programmable switching hardware in data centers opens up new possibilities for distributed applications to leverage in-network computing (INC). Literature so far has largely focused on individual application scenarios of INC, leaving aside the problem of coordinating usage of potentially scarce and heterogeneous switch resources among multiple INC scenarios, applications, and users. The traditional model of resource pools of isolated compute containers does not fit an INC-enabled data center. This paper describes HIRE, a Holistic INC-aware Resource managEr which allows for server-local and INC resources to be coordinated in a unified manner. HIRE introduces a novel flexible resource (meta-)model to address heterogeneity, resource interchangeability, and non-linear resource requirements, and integrates dependencies between resources and locations in a unified cost model, cast as a min-cost max-flow problem. In absence of prior work, we compare HIRE against variants of state-of-the-art schedulers retrofitted to handle INC requests. Experiments with a workload trace of a 4000 machine cluster show that HIRE makes better use of INC resources by serving 8- 30% more INC requests, while at the same time reducing network detours by 20%, and reducing tail placement latency by 50%.
引用
收藏
页码:268 / 285
页数:18
相关论文
共 50 条
  • [31] Empowering In-Network Gray Failure Detection with Programmable Switches
    Liu, Hong-Yan
    Zhang, Dong
    Wu, Chun-Ming
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (10): : 3613 - 3622
  • [32] Performant Deployment of a Virtualised Network Functions in a Data Center Environment using Resource Aware Scheduling
    McGrath, Michael J.
    Riccobene, Vincenzo
    Petralia, Guiseppe
    Xilouris, Georgios
    Kourtis, Michail-Alexandros
    [J]. PROCEEDINGS OF THE 2015 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM), 2015, : 1131 - 1132
  • [33] IMap: Fast and Scalable In-Network Scanning with Programmable Switches
    Li, Guanyu
    Zhang, Menghao
    Guo, Cheng
    Bao, Han
    Xu, Mingwei
    Hu, Hongxin
    Li, Fenghua
    [J]. PROCEEDINGS OF THE 19TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI '22), 2022, : 667 - 681
  • [34] IIsy: Hybrid In-Network Classification Using Programmable Switches
    Zheng, Changgang
    Xiong, Zhaoqi
    Bui, Thanh T.
    Kaupmees, Siim
    Bensoussane, Riyad
    Bernabeu, Antoine
    Vargaftik, Shay
    Ben-Itzhak, Yaniv
    Zilberman, Noa
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (03) : 2555 - 2570
  • [35] SOAR: Minimizing Network Utilization with Bounded In-network Computing
    Segal, Raz
    Avin, Chen
    Scalosub, Gabriel
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, CONEXT 2021, 2021, : 16 - 29
  • [36] Data center systems in the network computing age
    Mori, Nobumasa
    Yoshioka, Masaichiro
    Mori, Hiromichi
    Miyadera, Hiroo
    [J]. Hitachi Review, 1996, 45 (05): : 209 - 214
  • [37] Hybrid in-network computing and distributed learning for large-scale data processing
    Jeon, So-Eun
    Lee, Sun-Jin
    Lee, Il-Gu
    [J]. COMPUTER NETWORKS, 2023, 226
  • [38] Accelerating Convolutional Neural Network Inference in Split Computing: An In-Network Computing Approach
    Lee, Hochan
    Ko, Haneul
    Bae, Chanbin
    Pack, Sangheon
    [J]. 38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 773 - 776
  • [39] DumbNet: A Smart Data Center Network Fabric with Dumb Switches
    Li, Yiran
    Wei, Da
    Chen, Xiaoqi
    Song, Ziheng
    Wu, Ruihan
    Li, Yuxing
    Jin, Xin
    Xu, Wei
    [J]. EUROSYS '18: PROCEEDINGS OF THE THIRTEENTH EUROSYS CONFERENCE, 2018,
  • [40] When Network Matters: Data Center Scheduling with Network Tasks
    Giroire, F.
    Huin, N.
    Tomassilli, A.
    Perennes, S.
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 2278 - 2286