Autonomic failure prediction based on manifold learning for large-scale distributed systems

被引:6
|
作者
Lu X. [1 ]
Wang H.-Q. [1 ]
Zhou R.-J. [1 ]
Ge B.-Y. [1 ]
机构
[1] College of Computer Science and Technology, Harbin Engineering University
基金
中国国家自然科学基金;
关键词
autonomic computing; failure prediction; locally linear embedding; manifold learning;
D O I
10.1016/S1005-8885(09)60497-0
中图分类号
学科分类号
摘要
This article investigates autonomic failure prediction in large-scale distributed systems with nonlinear dimensionality reduction to automatically extract failure features. Most existing methods for failure prediction focus on building prediction models or heuristic rules by discovering failure patterns, but the process of feature extraction before failure patterns recognition is rarely considered due to the increasing complexity of modern distributed systems. In this work, a novel performance-centric approach to automate failure prediction is proposed based on manifold learning (ML). In addition, the ML algorithm named supervised locally linear embedding (SLLE) is applied to achieve feature extraction. To generalize the dimensionality reduction mapping, the nonlinear mapping approximation and optimization solution is also proposed. In experimental work a file transfer test bed with fault injection is developed which can gather multilevel performance metrics transparently. Based on the runtime monitoring of these metrics, the SLLE method can automatically predict more than 50 of the central processing unit (CPU) and memory failures, and around 70 of the network failure. © 2010 The Journal of China Universities of Posts and Telecommunications.
引用
收藏
页码:116 / 124
页数:8
相关论文
共 50 条
  • [1] Failure detectors for large-scale distributed systems
    Hayashibara, N
    Cherif, A
    Katayama, T
    21ST IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 404 - 409
  • [2] Large-scale manifold learning
    Talwalkar, Ameet
    Kumar, Sanjiv
    Rowley, Henry
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2554 - +
  • [3] Large-scale SVD and Manifold Learning
    Talwalkar, Ameet
    Kumar, Sanjiv
    Mohri, Mehryar
    Rowley, Henry
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 3129 - 3152
  • [4] Large-scale SVD and manifold learning
    1600, Microtome Publishing (14):
  • [5] A study of dynamic meta-learning for failure prediction in large-scale systems
    Lan, Zhiling
    Gu, Jiexing
    Zheng, Ziming
    Thakur, Rajeev
    Coghlan, Susan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (06) : 630 - 643
  • [6] A Large-Scale Manifold Learning Approach for Brain Tumor Progression Prediction
    Loc Tran
    Banerjee, Deb
    Sun, Xiaoyan
    Wang, Jihong
    Kumar, Ashok J.
    Vinning, David
    McKenzie, Frederic D.
    Li, Yaohang
    Li, Jiang
    MACHINE LEARNING IN MEDICAL IMAGING, 2011, 7009 : 265 - +
  • [7] A new approach of failure detection for large-scale distributed systems
    Yu, Xiangzhan
    Yuri, Xiaochun
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 3182 - 3186
  • [8] An autonomic operating environment for large-scale distributed applications
    Lehman, TJ
    Deen, RG
    Kaufman, JH
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2006, 13 (01) : 81 - 99
  • [9] Towards Detecting Patterns in Failure Logs of Large-Scale Distributed Systems
    Gurumdimma, Nentawe
    Jhumka, Arshad
    Liakata, Maria
    Chuah, Edward
    Browne, James
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1052 - 1061
  • [10] Structured Manifold Broad Learning System: A Manifold Perspective for Large-Scale Chaotic Time Series Analysis and Prediction
    Han, Min
    Feng, Shoubo
    Chen, C. L. Philip
    Xu, Meiling
    Qiu, Tie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (09) : 1809 - 1821