Multiclass classification of distributed memory parallel computations

被引:7
|
作者
Whalen, Sean [1 ]
Peisert, Sean [2 ,3 ]
Bishop, Matt [3 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
[2] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[3] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Multiclass classification; Bayesian networks; Random forests; Self-organizing maps; High performance computing; Communication patterns; NETWORK MOTIFS;
D O I
10.1016/j.patrec.2012.10.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High Performance Computing (HPC) is a field concerned with solving large-scale problems in science and engineering. However, the computational infrastructure of HPC systems can also be misused as demonstrated by the recent commoditization of cloud computing resources on the black market As a first step towards addressing this, we introduce a machine learning approach for classifying distributed parallel computations based on communication patterns between compute nodes. We first provide relevant background on message passing and computational equivalence classes called dwarfs and describe our exploratory data analysis using self organizing maps. We then present our classification results across 29 scientific codes using Bayesian networks and compare their performance against Random Forest classifiers. These models, trained with hundreds of gigabytes of communication logs collected at Lawrence Berkeley National Laboratory, perform well without any a priori information and address several shortcomings of previous approaches. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:322 / 329
页数:8
相关论文
共 50 条
  • [11] Parallel and distributed computations for structural mechanics:: A review
    Bittnar, Z
    Kruis, J
    Nemecek, J
    Patzák, B
    Rypl, D
    CIVIL AND STRUCTURAL ENGINEERING COMPUTING: 2001, 2001, : 211 - 233
  • [12] Distributed object based framework for parallel computations
    Li, Guo-Dong
    Zhang, De-Fu
    Ruan Jian Xue Bao/Journal of Software, 2002, 13 (03): : 342 - 353
  • [13] Parallel and distributed computations in a parameter inverse problem
    Telega, H
    VECTOR AND PARALLEL PROCESSING - VECPAR'96, 1997, 1215 : 183 - 197
  • [14] Parallel Large Scale High Accuracy Navier-Stokes Computations on Distributed Memory Clusters
    S. Peigin
    B. Epstein
    T. Rubin
    S. Seror
    The Journal of Supercomputing, 2004, 27 : 49 - 68
  • [15] Parallel large scale high accuracy Navier-Stokes computations on distributed memory clusters
    Peigin, S
    Epstein, B
    Rubin, T
    Seror, S
    JOURNAL OF SUPERCOMPUTING, 2004, 27 (01): : 49 - 68
  • [16] Privacy Preserving Multiclass Classification for Horizontally Distributed Data
    Lu, Yunmei
    Yan, Mingyuan
    Han, Meng
    Yang, Qingliang
    Zhang, Yanqing
    SIGITE'18: PROCEEDINGS OF THE 19TH ANNUAL SIG CONFERENCE ON INFORMATION TECHNOLOGY EDUCATION, 2018, : 165 - 165
  • [17] Visualizing Distributed Memory Computations with Hive Plots
    Engle, Sophie
    Whalen, Sean
    VIZSEC 2012: PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON VISUALIZATION FOR CYBER SECURITY, 2012, : 56 - 63
  • [18] ZAKI: A Smart Method and Tool for Automatic Performance Optimization of Parallel SpMV Computations on Distributed Memory Machines
    Sardar Usman
    Rashid Mehmood
    Iyad Katib
    Aiiad Albeshri
    Saleh M. Altowaijri
    Mobile Networks and Applications, 2023, 28 : 744 - 763
  • [19] ZAKI: A Smart Method and Tool for Automatic Performance Optimization of Parallel SpMV Computations on Distributed Memory Machines
    Usman, Sardar
    Mehmood, Rashid
    Katib, Iyad
    Albeshri, Aiiad
    Altowaijri, Saleh M.
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (02): : 744 - 763
  • [20] Applying parallel/distributed computing to advanced algebraic computations
    Ajwa, IA
    Wang, PS
    PROCEEDINGS OF THE IEEE 1997 AEROSPACE AND ELECTRONICS CONFERENCE - NAECON 1997, VOLS 1 AND 2, 1997, : 156 - 164