Extendable MQTT Broker for Feedback-based Resource Management in Large-scale Computing Environments

被引:0
|
作者
Ouchi, Ryo [1 ]
Sakamoto, Ryuichi [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
来源
PROCEEDINGS OF THE 7TH ASIA-PACIFIC WORKSHOP ON NETWORKING, APNET 2023 | 2023年
关键词
D O I
10.1145/3600061.3603129
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-performance computing (HPC) systems demand continuous monitoring to ensure efficient resource allocation and application performance. Recent studies indicate that real-time resource utilization monitoring can significantly improve the performance of dynamic scheduling algorithms. However, latency induced by protocol stack heavily impacts the effectiveness of dynamic scheduling. In this paper, we propose a novel monitoring system that implements the protocol stack on a Field-Programmable Gate Array (FPGA) and adopts a publish/subscribe (pub/sub) communication protocol. Specifically, by introducing an FPGA-based protocol stack, we substantially reduce the latency of protocol stack processing and enable the implementation of custom plugins at the L7 layer. Our experiments demonstrate that the proposed system effectively reduces protocol stack latency and, with the extensibility provided by user-defined plugins, offers great potential for a wide range of HPC monitoring and feedback applications.
引用
收藏
页码:190 / 191
页数:2
相关论文
共 50 条
  • [41] Video Management and Resource Allocation for a Large-Scale VoD Cloud
    Chang, Zhangyu
    Chan, S. -H. Gary
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2016, 12 (05)
  • [42] GODEL: Unified Large-Scale Resource Management and Scheduling at ByteDance
    Xiang, Wu
    Li, Yakun
    Ren, Yuquan
    Jiang, Fan
    Xin, Chaohui
    Gupta, Varun
    Xiang, Chao
    Song, Xinyi
    Liu, Meng
    Li, Bing
    Shao, Kaiyang
    Xu, Chen
    Shao, Wei
    Fu, Yuqi
    Wang, Wilson
    Xu, Cong
    Xu, Wei
    Lin, Caixue
    Shi, Rui
    Liang, Yuming
    PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON CLOUD COMPUTING, SOCC 2023, 2023, : 308 - 323
  • [43] Large-Scale Cognitive Cellular Systems: Resource Management Overview
    Guizani, Mohsen
    Khalfi, Bassem
    Ben Ghorbel, Mahdi
    Hamdaoui, Bechir
    IEEE COMMUNICATIONS MAGAZINE, 2015, 53 (05) : 44 - 51
  • [44] Distributed and heuristic policy-based resource management system for large-scale Grids
    Magana, Edgar
    Serrat, Joan
    INTER-DOMAIN MANAGEMENT, PROCEEDINGS, 2007, 4543 : 184 - +
  • [45] Super-service-oriented Architecture in Large-scale Pervasive Computing Environments
    蔡学明
    贺樑
    段新娥
    JournalofDonghuaUniversity(EnglishEdition), 2008, (03) : 269 - 272
  • [46] Anomaly Detection for Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
    Dang, Yue
    Wang, Bin
    Brant, Ryan
    Zhang, Zhiping
    Alqallaf, Maha
    Wu, Zhiqiang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS 2017), 2017, : 121 - 130
  • [47] Super-service-oriented architecture in large-scale pervasive computing environments
    Cai, Xue-Ming
    He, Liang
    Duan, Xin-E
    Journal of Donghua University (English Edition), 2008, 25 (03) : 269 - 272
  • [48] Edge Computing and Social Internet of Things for Large-Scale Smart Environments Development
    Cicirelli, Franco
    Guerrieri, Antonio
    Spezzano, Giandomenico
    Vinci, Andrea
    Briante, Orazio
    Iera, Antonio
    Ruggeri, Giuseppe
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (04): : 2557 - 2571
  • [49] A Context-aware Service Framework for Large-Scale Ambient Computing Environments
    Satoh, Ichiro
    INTERNATIONAL CONFERENCE ON PERVASIVE SERVICES (ICPS 2009), 2009, : 199 - 207
  • [50] Optimal Network Structuring for Large-Scale WSN with Virtual Broker based Publish/Subscribe
    Liu, Yang
    Seet, Boon-Chong
    PROCEEDINGS OF THE 2017 2ND WORKSHOP ON RECENT TRENDS IN TELECOMMUNICATIONS RESEARCH (RTTR), 2017, : 64 - 68