Low-cost fault-tolerance protocol for large-scale network monitoring

被引:0
|
作者
Ahn, J
Min, SG
Choi, YI
Lee, BS
机构
[1] Kyonggi Univ, Coll Informat Sci, Dept Comp Sci, Suwonsi Kyonggido 442760, South Korea
[2] Korea Univ, Dept Comp Sci & Engn, Seoul 136701, South Korea
[3] Elect & Telecommun Res Inst, Network Technol Lab, Taejon 305600, South Korea
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Distributed hierarchical network monitoring model has been proposed to solve scalability problem of centralized model. In this distributed model, a top-level monitoring manager, called main manager, obtains aggregate management information from mid-level managers, named domain managers, forming a hierarchical structure. However, if some of monitoring managers crash, network elements cannot be continuously and correctly monitored until the managers are repaired. To address this important, but previously unresolved issue, this paper presents a new fault-tolerance protocol for domain managers, named DMFTP, allowing the managers to efficiently utilize their organization structure. Therefore, this protocol can minimize failure detection overhead and the number of live managers affected by each manager node crash. Also, it tolerates concurrent manager failures and, after the failed managers have been repaired, ensures their immediate and consistent recovery.
引用
收藏
页码:504 / 513
页数:10
相关论文
共 50 条
  • [1] Low-cost fault-tolerance in barrier synchronizations
    Kulkarni, SS
    Arora, A
    [J]. 1998 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - PROCEEDINGS, 1998, : 132 - 139
  • [2] Enhancing fault-tolerance of large-scale MPI scientific applications
    Rodriguez, G.
    Gonzalez, P.
    Martin, M. J.
    Tourino, J.
    [J]. PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2007, 4671 : 153 - 161
  • [3] Evaluating low-cost fault-tolerance mechanism for microprocessors on multimedia applications
    Sato, T
    Arita, I
    [J]. 2001 PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2001, : 225 - 232
  • [4] EPIPE: A low-cost fault-tolerance technique considering WCET constraints
    Li, Jianli
    Xue, Jingling
    Xie, Xinwei
    Wan, Qing
    Tan, Qingping
    Tan, Lanfang
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2013, 59 (10) : 1383 - 1393
  • [5] Replication-Based Fault-Tolerance for Large-Scale Graph Processing
    Chen, Rong
    Yao, Youyang
    Wang, Peng
    Zhang, Kaiyuan
    Wang, Zhaoguo
    Guan, Haibing
    Zang, Binyu
    Chen, Haibo
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (07) : 1621 - 1635
  • [6] Replication-based Fault-tolerance for Large-scale Graph Processing
    Wang, Peng
    Zhang, Kaiyuan
    Chen, Rong
    Chen, Haibo
    Guan, Haibing
    [J]. 2014 44TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2014, : 562 - 573
  • [7] A Low-cost Sensor Platform for Large-scale Wideband Spectrum Monitoring
    Calvo-Palomino, Roberto
    Pfammatter, Damian
    Giustiniano, Domenico
    Lenders, Vincent
    [J]. IPSN'15: PROCEEDINGS OF THE 14TH INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2015, : 396 - 397
  • [8] Low-cost fault-tolerance for mobile nodes in mobile IP based systems
    Ahn, J
    Hwang, C
    [J]. 21ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS, PROCEEDINGS, 2001, : 508 - 513
  • [9] Wipi: A Low-Cost Large-Scale Remotely-Accessible Network Testbed
    Attaby, Abdelhamid
    Osman, Nada
    Elnainay, Mustafa
    Youssef, Moustafa
    [J]. IEEE ACCESS, 2019, 7 : 167795 - 167814
  • [10] A protocol to establish low-cost floating treatment wetlands for large-scale wastewater reclamation
    Arslan, Muhammad
    Iqbal, Samina
    Islam, Ejazul
    El-Din, Mohamed Gamal
    Afzal, Muhammad
    [J]. STAR PROTOCOLS, 2023, 4 (04):