Extendable MQTT Broker for Feedback-based Resource Management in Large-scale Computing Environments

被引:0
|
作者
Ouchi, Ryo [1 ]
Sakamoto, Ryuichi [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
来源
PROCEEDINGS OF THE 7TH ASIA-PACIFIC WORKSHOP ON NETWORKING, APNET 2023 | 2023年
关键词
D O I
10.1145/3600061.3603129
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-performance computing (HPC) systems demand continuous monitoring to ensure efficient resource allocation and application performance. Recent studies indicate that real-time resource utilization monitoring can significantly improve the performance of dynamic scheduling algorithms. However, latency induced by protocol stack heavily impacts the effectiveness of dynamic scheduling. In this paper, we propose a novel monitoring system that implements the protocol stack on a Field-Programmable Gate Array (FPGA) and adopts a publish/subscribe (pub/sub) communication protocol. Specifically, by introducing an FPGA-based protocol stack, we substantially reduce the latency of protocol stack processing and enable the implementation of custom plugins at the L7 layer. Our experiments demonstrate that the proposed system effectively reduces protocol stack latency and, with the extensibility provided by user-defined plugins, offers great potential for a wide range of HPC monitoring and feedback applications.
引用
收藏
页码:190 / 191
页数:2
相关论文
共 50 条
  • [21] Component-based, problem-solving environments for large-scale scientific computing
    Johnson, C
    Parker, S
    Weinstein, D
    Heffernan, S
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2002, 14 (13-15): : 1337 - 1349
  • [22] Third-Party Broker-Based Resource Management in Mobile Computing
    Xiong, Yong-Hua
    Li, Lei
    Jiang, Ke-Yuan
    Yu, Hong
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (02) : 262 - 270
  • [23] Mesh data management in large-scale scientific computing
    Chen, Hong
    Zheng, Winmin
    PROCEEDINGS OF THE THIRD CHINAGRID ANNUAL CONFERENCE, 2008, : 144 - 152
  • [24] A Context Management Architecture for Large-Scale Smart Environments
    Oh, Yoosoo
    Han, Jonghyun
    Woo, Woontack
    IEEE COMMUNICATIONS MAGAZINE, 2010, 48 (03) : 118 - 126
  • [25] Distributed workflow management for large-scale grid environments
    Schneider, J
    Linnert, B
    Burchard, LO
    INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET , PROCEEDINGS, 2006, : 229 - +
  • [26] Performance Analysis of the FCBSH Algorithm for Large-Scale Heterogeneous Computing Environments
    Du, Xiaoli
    Jiang, Changjun
    Yin, Fei
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2009, 3 (04) : 461 - 476
  • [27] A Large-scale Distribution and Deployment of Robot Task Based on MQTT Protocol and ROS
    Wei, Jing
    Shi, Dianxi
    Yan, Bingzheng
    Hu, Yerong
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ELECTRONIC INDUSTRY AND AUTOMATION (EIA 2017), 2017, 145 : 308 - 313
  • [28] A feedback-based combinatorial fair economical double auction resource allocation model for cloud computing
    Singhal, Ritu
    Singhal, Archana
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 : 780 - 797
  • [29] INVESTIGATING THE USE OF LARGE-SCALE IMMERSIVE COMPUTING ENVIRONMENTS IN COLLABORATIVE DESIGN
    Rosenberg, Meisha
    Vance, Judy M.
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2014, VOL 3, 2014,
  • [30] A semantic service discovery network for large-scale ubiquitous computing environments
    Kang, Saehoon
    Kim, Daewoong
    Lee, Younghee
    Hyun, Soon J.
    Lee, Dongman
    Lee, Ben
    ETRI JOURNAL, 2007, 29 (05) : 545 - 558