Scalable Real-Time Monitoring for Distributed Applications

被引:6
|
作者
Yuen, C. -H. Philip [1 ]
Chan, S. -H. Gary [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Sino Software Res Inst, Risk Management & Business Intelligence Program, Kowloon, Hong Kong, Peoples R China
关键词
Distributed protocol; real-time network monitoring; peer-to-peer network; proxies; ALGORITHMS;
D O I
10.1109/TPDS.2012.60
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In order to assess service quality of a networked application (such as a streaming session), distributed monitoring servers need to continuously collect application-specific performance metrics in real time. Much of the previous work to address this is to use distributed aggregation tree (DAT) rooted at each monitor. However, this approach often leads to high monitoring delay and network stress. In this paper, we study a highly scalable monitoring network for distributed applications. In the network, there are distributed monitors collecting application performance in two steps: first, client applications report their performance to some proxies by means of a client overlay, and then the proxies report the performance to the distributed monitors using another proxy overlay. We first formulate the problem to construct overlays minimizing monitoring delay. The problem is shown to be NP-hard. Then, we present a simple, efficient, and scalable monitoring algorithm called SMon, which continuously reduces network diameter in real time in a distributed manner. Through simulations and actual experimental measurements with implementation, we show that SMon achieves low monitoring delay, network stress, and protocol overhead for distributed applications.
引用
收藏
页码:2330 / 2337
页数:8
相关论文
共 50 条
  • [1] Scalable Data Gathering for Real-time Monitoring Systems on Distributed Computing
    Kamoshida, Yoshikazu
    Taura, Kenjiro
    [J]. CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 425 - 432
  • [2] Distributed, scalable, dependable real-time systems: Middleware services and applications
    Welch, LR
    Ravindran, B
    Werme, PV
    Masters, MW
    Shirazi, BA
    Shirolkar, PA
    Harrison, RD
    Mills, W
    Do, T
    Lafratta, J
    Anwar, SM
    Sharp, S
    Sergeant, T
    Bilowus, G
    Swick, M
    Hoppel, J
    Caruso, J
    [J]. IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 297 - 301
  • [3] Exploring Scalable, Distributed Real-Time Anomaly Detection for Bridge Health Monitoring
    Moallemi, Amirhossein
    Burrello, Alessio
    Brunelli, Davide
    Benini, Luca
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (18) : 17660 - 17674
  • [4] Scalable, distributed, real-time map generation
    Davies, Jonathan J.
    Beresford, Alastair R.
    Hopper, Andy
    [J]. IEEE PERVASIVE COMPUTING, 2006, 5 (04) : 47 - 54
  • [5] Monitoring distributed real-time systems
    [J]. Shiyou Hiagong Gaodeng Xuexiao Xuebao, 1 (71-73, 86):
  • [6] MONITORING DISTRIBUTED REAL-TIME SYSTEMS
    SCHMID, U
    [J]. REAL-TIME SYSTEMS, 1994, 7 (01) : 33 - 56
  • [7] Monitoring Distributed Real-Time Systems
    于波
    [J]. 石油化工高等学校学报, 1998, (01) : 72 - 74+87
  • [8] Scalable Real-Time Analytics for IoT Applications
    Mahmood, Khalid
    Risch, Tore
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 404 - 406
  • [9] Synchronization for distributed real-time applications
    Kshemkalyani, AD
    [J]. PROCEEDINGS OF THE JOINT WORKSHOP ON PARALLEL AND DISTRIBUTED REAL-TIME SYSTEMS: FIFTH INTERNATIONAL WORKSHOP ON PARALLEL AND DISTRIBUTED REAL-TIME SYSTEMS (WPDRTS) AND THE THIRD WORKSHOP ON OBJECT-ORIENTED REAL-TIME SYSTEMS (OORTS), 1997, : 81 - 90
  • [10] A DISTRIBUTED SYSTEM FOR REAL-TIME APPLICATIONS
    FATHI, ET
    BOSSE, E
    CASEAULT, J
    [J]. IEEE MICRO, 1987, 7 (06) : 21 - 28