A fine-grained robust performance diagnosis framework for run-time cloud applications

被引:0
|
作者
Xin, Ruyue [1 ]
Chen, Peng [2 ]
Grosso, Paola [1 ]
Zhao, Zhiming [1 ]
机构
[1] Univ Amsterdam, MultiScale Networked Syst MNS, Amsterdam, Netherlands
[2] Xihua Univ, Sch Comp & Software Engn, Chengdu, Peoples R China
基金
欧盟地平线“2020”;
关键词
Performance diagnosis; Metrics selection; Deep ensemble learning; Causal graph; Fine-grained;
D O I
10.1016/j.future.2024.02.014
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To maintain the required service quality of time -critical cloud applications, operators must continuously monitor their runtime status, detect potential performance anomalies, and diagnose the root causes of these anomalies effectively. However, existing performance diagnosis methods face challenges such as the need for high -quality labeled data, the low reusability and robustness of performance anomaly detection models, and the absence of real-time fine-grained root cause localization. These challenges make fixing performance issues quickly and developing effective adaptation decisions difficult. We provide a Fine-grained Robust Performance Diagnosis (FIRED) framework to tackle those challenges. The framework offers a metrics selection component to filter noise and improve detection efficiency, an anomaly detection component that assembles several wellselected base models with a deep neural network, and adopts weakly supervised learning considering fewer labels exist in reality. The framework also employs a real-time, fine-grained root cause localization component to locate dependent resource metrics of performance anomalies. Our experiments show that the framework can effectively reduce data noise and achieve the best accuracy and algorithm robustness for performance anomaly detection. In addition, the framework can accurately localize the first root causes, with an average accuracy higher than 0.7 for locating the first four root cause metrics.
引用
收藏
页码:300 / 311
页数:12
相关论文
共 50 条
  • [1] Timely Fine-grained Interference-sensitive Run-time Adaptation of Time-triggered Schedules
    Skalistis, Stefanos
    Kritikakou, Angeliki
    [J]. 2019 IEEE 40TH REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2019), 2019, : 233 - 245
  • [2] Fine-Grained Aging-Induced Delay Prediction Based on the Monitoring of Run-Time Stress
    Vijayan, Arunkumar
    Koneru, Abhishek
    Kiamehr, Saman
    Chakrabarty, Krishnendu
    Tahoori, Mehdi B.
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (05) : 1064 - 1075
  • [3] Geyser-2: The second prototype CPU with fine-grained run-time Power Gating
    Zhao, L.
    Ikebuchi, D.
    Saito, Y.
    Kamata, M.
    Seki, N.
    Kojima, Y.
    Amano, H.
    Koyama, S.
    Hashida, T.
    Umahashi, Y.
    Masuda, D.
    Usami, K.
    Kimura, K.
    Namiki, M.
    Takeda, S.
    Nakamura, H.
    Kondo, M.
    [J]. 2011 16TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2011,
  • [4] Studying users' adaptation to Android's run-time fine-grained access control system
    Andriotis, Panagiotis
    Stringhini, Gianluca
    Sasse, Martina Angela
    [J]. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2018, 40 : 31 - 43
  • [5] Run-time Mechanisms for Fine-Grained Parallelism on Network Processors: the TILEPro64 Experience
    Buono, Daniele
    Mencagli, Gabriele
    [J]. 2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 55 - 64
  • [6] Low-overhead run-time scheduling for fine-grained acceleration of signal processing systems
    Boutellier, Jani
    Bhattacharyya, Shuvra S.
    Silven, Olli
    [J]. 2007 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS, VOLS 1 AND 2, 2007, : 457 - +
  • [7] Fine-Grained Aging Prediction Based on the Monitoring of Run-Time Stress Using DfT Infrastructure
    Koneru, Abhishek
    Vijayan, Arunkumar
    Chakrabarty, Krishnendu
    Tahoori, Mehdi B.
    [J]. 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2015, : 51 - 58
  • [8] A design approach for fine-grained run-time power gating using locally extracted sleep signals
    Usami, Kimiyoshi
    Ohkubo, Naoaki
    [J]. PROCEEDINGS 2006 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2007, : 155 - 161
  • [9] A Fine-Grained Performance Model of Cloud Computing Centers
    Khazaei, Hamzeh
    Misic, Jelena
    Misic, Vojislav B.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (11) : 2138 - 2147
  • [10] A development framework for building fine-grained CORBA® applications
    Acton, D
    Coatta, T
    Phillips, P
    Sample, M
    [J]. ENTERPRISE DISTRIBUTED OBJECT COMPUTING - PROCEEDINGS SECOND INTERNATIONAL WORKSHOP, 1998, : 183 - 193