Research on the Health Diagnosis Module of Large-scale Clusters

被引:0
|
作者
Yang, Cong [1 ]
Du, Wen-long [1 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Cloud Comp Res Ctr, Shenzhen, Peoples R China
关键词
cloud computing; decision tree; health diagnosis module; MONITORING-SYSTEM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
a large number of low-level performance metrics include process, virtual and physical machine metrics that can be measured to identify a node or even a cluster health status. Traditionally, nodes in the cluster are monitored and managers need to analyze each metrics and alarming messages from monitoring tools to identify the health status of clusters. However, this process would cost too much time on some insignificant metrics and with less efficient because most clusters have more than hundreds nodes and it's impossible for one manager to check too much metrics in each nodes. In this work, we demonstrate that more time can be saved by simplify metrics set, scoring each nodes and diagnosis nodes health status by decision tree. Specially, this work first experimentally verifies and sorts the degree of relation between node health and different metrics. After that, we collect and score the training set by load increase testing. Thirdly, we construct a decision tree by training set. Finally, a health diagnosis module is composed by previous process, algorithm and decision tree. We evaluate the Health Diagnosis Module (HDM) on the Normal PC cluster. Experiments show that HDM can precise diagnose nodes and clusters' health status with more than 89% accuracy rate.
引用
收藏
页码:589 / 593
页数:5
相关论文
共 50 条
  • [1] Clusters and large-scale structure
    Bahcall, NA
    [J]. SEVENTEENTH TEXAS SYMPOSIUM ON RELATIVISTIC ASTROPHYSICS AND COSMOLOGY, 1995, 759 : 636 - 649
  • [2] Large-scale loyalty card data in health research
    Nevalainen, Jaakko
    Erkkola, Maijaliisa
    Saarijarvi, Hannu
    Nappila, Turkka
    Fogelholm, Mikael
    [J]. DIGITAL HEALTH, 2018, 4
  • [3] LARGE-SCALE RESEARCH
    KORBMANN, R
    [J]. UMSCHAU IN WISSENSCHAFT UND TECHNIK, 1981, 81 (15) : 449 - 449
  • [4] Large-scale simulations of clusters of galaxies
    Ricker, PM
    Calder, AC
    Dursi, LJ
    Fryxell, B
    Lamb, DQ
    MacNeice, P
    Olson, K
    Rosner, R
    Timmes, FX
    Truran, JW
    Tufo, HM
    Zingale, M
    [J]. ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2001, 583 : 316 - 318
  • [5] LARGE-SCALE DISTRIBUTION OF CLUSTERS OF GALAXIES
    SCHMIDT, KH
    [J]. ASTRONOMISCHE NACHRICHTEN, 1983, 304 (05) : 201 - 210
  • [6] Anomaly Localization in Large-Scale Clusters
    Zheng, Ziming
    Li, Yawei
    Lan, Zhiling
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, 2007, : 322 - 330
  • [7] Clusters as large-scale development facilities
    Evard, R
    Desai, N
    Navarro, JP
    Nurmi, D
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2002, : 54 - 63
  • [8] Precise structural health diagnosis of large-scale complex structures
    Zhu, Hongping
    Weng, Shun
    Wang, Dansheng
    Sun, Yanhua
    Xia, Yong
    Gao, Fei
    [J]. Jianzhu Jiegou Xuebao/Journal of Building Structures, 2019, 40 (02): : 215 - 226
  • [9] Design of SPI module in large-scale network
    Yoon, S
    Oh, J
    Jang, J
    [J]. 8th International Conference on Advanced Communication Technology, Vols 1-3: TOWARD THE ERA OF UBIQUITOUS NETWORKS AND SOCIETIES, 2006, : U1706 - U1711
  • [10] BIOBANKS: ADDING A NEW DIMENSION TO LARGE-SCALE HEALTH RESEARCH
    Banks, Emily
    [J]. INTERNAL MEDICINE JOURNAL, 2013, 43 : 6 - 6