Efficient Classification of Supercomputer Failures Using Neuromorphic Computing

被引:0
|
作者
Date, Prasanna [1 ]
Carothers, Christopher D. [1 ]
Hendler, James A. [1 ]
Magdon-Ismail, Malik [1 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
关键词
Neuromorphic Computing; Deep Learning; Machine Learning; Supercomputer Failures; LARGE-SCALE; DESIGN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Today's petascale supercomputers are comprised of ten's of thousands of compute nodes. Failures on these massive machines are a growing problem as the time for a single compute node to fail is shrinking. Ideally, the job scheduler would like the capability to predict node failures ahead of time in order to minimize the impact of node failures on overall job throughput. However, due to the tight power constraints of future systems, the online modeling of real-time error data must be accomplished using as little power as possible. To this end, the IBM TrueNorth Neurosynaptic System is used to create a Spiking Neural Network (SNN) model of supercomputer failure data and the classification accuracy of this model is compared to other Machine Learning (ML) and Deep Learning (DL) techniques. It is observed that the TrueNorth failure classification model yields a training accuracy of 99.41%, validation accuracy of 98.12% and testing accuracy of 99.80% and outperforms other machine learning and deep learning approaches. Moreover, the TrueNorth SNN consumes five orders of magnitude less power than the other ML/DL approaches during the testing phase. Additionally, it is observed that all MULH., approaches investigated as part of this study are able to produce accurate models of the supercomputer system failure data.
引用
收藏
页码:242 / 249
页数:8
相关论文
共 50 条
  • [21] Convolutional networks for fast, energy-efficient neuromorphic computing
    Esser, Steven K.
    Merolla, Paul A.
    Arthur, John V.
    Cassidy, Andrew S.
    Appuswamy, Rathinakumar
    Andreopoulos, Alexander
    Berg, David J.
    McKinstry, Jeffrey L.
    Melano, Timothy
    Barch, Davis R.
    di Nolfo, Carmelo
    Datta, Pallab
    Amir, Arnon
    Taba, Brian
    Flickner, Myron D.
    Modha, Dharmendra S.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (41) : 11441 - 11446
  • [22] Memristor-based Energy-Efficient Neuromorphic Computing
    Tang, Jianshi
    2022 INTERNATIONAL CONFERENCE ON IC DESIGN AND TECHNOLOGY (ICICDT), 2022, : XIX - XIX
  • [23] Highly Efficient Neuromorphic Computing Systems with Emerging Nonvolatile Memories
    Taylor, Brady
    Li, Ziru
    Yan, Bonan
    Li, Hai
    Chen, Yiran
    NOVEL PATTERNING TECHNOLOGIES FOR SEMICONDUCTORS, MEMS/NEMS AND MOEMS 2020, 2020, 11324
  • [24] Energy Efficient Approximate Arithmetic for Error Resilient Neuromorphic Computing
    Kim, Yongtae
    Zhang, Yong
    Li, Peng
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2015, 23 (11) : 2733 - 2737
  • [25] An Efficient Programming Framework for Memristor-based Neuromorphic Computing
    Li Zhang, Grace
    Li, Bing
    Huang, Xing
    Shen, Chen
    Zhang, Shuhang
    Burcea, Florin
    Graeb, Helmut
    Ho, Tsung-Yi
    Li, Hai
    Schlichtmann, Ulf
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 1068 - 1073
  • [26] Engineering Spiking Neurons Using Threshold Switching Devices for High-Efficient Neuromorphic Computing
    Ding, Yanting
    Zhang, Yajun
    Zhang, Xumeng
    Chen, Pei
    Zhang, Zefeng
    Yang, Yue
    Cheng, Lingli
    Mu, Chen
    Wang, Ming
    Xiang, Du
    Wu, Guangjian
    Zhou, Keji
    Yuan, Zhe
    Liu, Qi
    FRONTIERS IN NEUROSCIENCE, 2022, 15
  • [27] Beyond Memristors: Neuromorphic Computing Using Meminductors
    Wang, Frank Zhigang
    MICROMACHINES, 2023, 14 (02)
  • [28] HyperNode: An Efficient Node Classification Framework Using HyperDimensional Computing
    Li, Haomin
    Liu, Fangxin
    Chen, Yichi
    Jiang, Li
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [29] Cluster computing: the commodity supercomputer
    Baker, Mark
    Buyya, Rajkumar
    Software - Practice and Experience, 1999, 29 (06): : 551 - 576
  • [30] Cluster computing: The commodity supercomputer
    Baker, M
    Buyya, R
    SOFTWARE-PRACTICE & EXPERIENCE, 1999, 29 (06): : 551 - 576